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Although  much  attention  has  been  given  to  the  problem  of 
estimating  variance  components  in  unbalanced  random  and  mixed  linear 
models,  the  same  cannot  be  said  about  the  development  of  exact  testing 
procedures  for  the  aforementioned  quantities.   One  reason  for  this  is 
the  mathematical  complexity  of  the  problem.   Even  in  some  balanced 
random  or  mixed  models,  no  exact  analysis  of  variance  tests  exist  for 
testing  certain  variance  components.   Consequently,  approximate 
methods,  such  as  Satterthwaite's  procedure,  are  typically  relied  upon 
to  furnish  the  required  tests.   Unfortunately,  the  exact  distributions 
of  these  approximate  procedures  are  generally  unknown,  even  under  the 
null  hypothesis. 
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Recently,  some  progress  has  been  made  in  the  construction  of 
exact  tests  for  the  variance  components  and  for  estimable  linear 
functions  of  the  fixed-effects  parameters  in  unbalanced  random  and 
mixed  models.   In  this  dissertation  we  extend  some  of  these  known 
results  and  generate  some  new  ones.   Specifically,  exact  procedures  are 
developed  to  test  hypotheses  concerning  the  variance  components  in  an 
unbalanced  random  two-way  model  with  some  cells  missing,  an  unbalanced 
random  3-fold  nested  model,  and  an  unbalanced  random  s-fold  nested 
model  where  the  imbalance  affects  the  last  two  stages  only.   Also, 
methods  are  derived  to  test  hypotheses  about  the  variance  components 
and  about  estimable  linear  functions  of  the  fixed-effects  parameters  in 
a  general  unbalanced  mixed  model  with  imbalance  affecting  the  last 
stage  only. 
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CHAPTER  ONE 
INTRODUCTION 

! 

Although  much  attention  has  been  given  to  the  problem  of 
estimating  variance  components  (see,  e.g.,  Rao  and  Kleffe  (1988)), 
the  same  cannot  be  said  about  the  development  of  exact  testing 
procedures  concerning  the  variance  components  in  unbalanced  random 
and  mixed  linear  models.   Perhaps  one  reason  for  this  is  due  to  the 
mathematical  complexity  of  the  problem.   Even  for  some  balanced 
models  no  exact  analysis  of  variance  (ANOVA)  tests  exist  for  testing 
certain  variance  components.   For  example,  in  a  three-way  cross- 
classification  random-effects  model,  no  exact  procedures  exist  for 
testing  hypotheses  concerning  the  variance  components  associated  with 
the  main  effects  of  the  model.   Also,  when  exact  tests  do  exist  for 
balanced  models,  they  generally  cannot  be  used  to  test  the 
corresponding  hypotheses  in  unbalanced  models.   The  reason  for  this 
is  that  for  unbalanced  models,  the  traditional  test  statistics  used 
with  balanced  data  are,  in  general,  no  longer  based  upon  independent, 
chi-squared-type  sums  of  squares. 

Approximate  methods,  such  as  Satterthwaite' s  (1946)  procedure, 
are  typically  relied  upon  to  furnish  tests  for  the  variance 
components  when  no  such  exact  procedure  exists.   However,  approximate 
methods  are  just  that  --  approximate.   Some  may  be  relatively  easy  to 
implement,  and  perform  reasonably  well  in  certain  situations,  but 
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they  cannot  be  expected  to  always  produce  adequate  solutions  (see, 
e.g.,  Cummings  and  Gaylor  (1974),  Tietjen  (1974),  and  more  recently, 
Ames  and  Webster  (1991)).   Thus  there  is  a  clear  need  for  the 
development  of  exact  testing  procedures. 

In  this  dissertation  we  have  concentrated  on  developing  exact 
methodology  for  some  unbalanced  random  and  mixed  linear  models.   Much 
of  this  work  has  been  inspired  by  the  derivations  given  in  Khuri 
(1984,  1987,  1990),  Khuri  and  Littell  (1987),  Gallo  (1987),  and  Gallo 
and  Khuri  (1990).   Their  results  and  those  obtained  in  this 
dissertation  are  based  on  certain  transformations  which  reduce  the 
analysis  of  the  unbalanced  model  to  that  of  a  balanced  one.   A 

crucial  theme  in  the  construction  of  these  transformations  is  a 

I 

1 

scheme  for  resampling  from  the  error  vector.   In  this  scheme,  only  a 
portion  of  the  error  vector  is  utilized  in  making  the  transformation. 
Consequently,  the  values  of  the  resulting  test  statistics  will  not  be 
unique.   However,  the  distributional  properties  of  these  statistics 
will  remain  unchanged  no  matter  what  portion  of  the  error  vector  is 
used  in  the  transformation.   It  is  in  this  sense  that  our  methodology 
is  similar  to  procedures  involving  resampling.   Appendix  C  provides 
an  illustration  of  the  utility  of  resampling  as  a  means  of 
"counteracting"  the  imbalance  in  a  design. 

This  thesis  is  divided  into  four  chapters.   Chapters  one  and  two 
give  a  brief  introduction  to  the  area  of  exact  testing  procedures  in 
unbalanced  random  and  mixed  linear  models  and  a  review  of  the 
pertinent  literature.   The  main  contributions  of  this  study  are  found 
in  Chapter  three.   This  chapter  is  divided  into  four  sections.   In 


the  first  section  we  develop  exact  tests  for  the  variance  components 
in  a  random  two-way  model  with  some  cells  missing.   Section  two  deals 
with  the  unbalanced  random  3-fold  nested  model.   Here  the  imbalance 
is  assumed  to  affect  all  stages  of  the  design.   In  the  last  two 
sections  we  concentrate  on  more  general  models.   The  general  mixed 
model  with  imbalance  affecting  the  last  stage  only  is  analyzed  in 
Section  three.   While  in  Section  four  we  obtain  exact  procedures   for 
testing  the  variance  components  in  a  random  s-fold  nested  model  with 
imbalance  affecting  the  last  two  stages  only.  Finally,  Chapter  four 
summarizes  our  findings  and  offers  directions  for  future  research. 

Throughout  this  work  we  will  try  to  motivate,  as  much  as 
possible,  the  reason  behind  each  transformation  we  use.   At  times, 
the  mathematical  derivations  may  become  "intense"  in  the  sense  that 
there  may  be  pages  containing  nothing  but  equations.   Unfortunately, 
there  seems  to  be  no  way  to  avoid  this.  A  numerical  example  is  given 
at  the  end  of  each  section  in  Chapter  three  to  illustrate  the 
proposed  methodology.   These  examples  also  serve  as  useful  (and 
probably  needed)  "rest  stops"  between  pages  full  of  formulas. 

The  following  notation  will  be  used  throughout  this  thesis: 

•  rank(A)  will  denote  the  rank  of  the  matrix  A 

•  range(A)  will  denote  the  range  space  of  the  matrix  A,  that 
is,  the  vector  space  spanned  by  the  columns  of  A. 

•  tr(A)  will  denote  the  trace  of  the  matrix  A. 

•  A"  will  denote  a  generalized  inverse  of  A,  that  is,  any 
matrix  which  satisfies  AA~A  =  A. 


A  0 
0  B 


This  can  also  be 


•  diag(A,  B)  will  denote  the  matrix 
represented  by  AeB  where  "0"  is  the  direct  sura  operator. 

•  A(8)B  will  be  used  to  represent  the  (right)  direct  (Kronecker) 
product  of  two  matrices.   Briefly,  if  A  is  n x m  and  B  is  pxq 


then  A(8)B  is  the  npxmq  partitioned  matrix  whose  (i,j) 


th 


i\th 


matrix  is  a^ -B  where  a^-  is  the  (i,j)   element  of  A 


CHAPTER  TWO 
LITERATURE  REVIEW 


The  first  paper  that  addresses  the  issue  of  exact  tests  for 
variance  components  in  unbalanced  models  is  due  to  Wald  (1940).   Wald 
constructed  an  exact  test  for  (j\l (y\    in  the  random  one-way  model: 


y.j  =  ;.  +  a.+eij 


(2.1) 


(i  =  1,2,  •  •  •  ,p;  j  =  1,2,  •  •  •  ,m- )  ,  where  /x  is  an  unknown  constant 
parameter;  a-  and  e-  •  are  independent,  normally  distributed  random 
variables  with  zero  means  and  variances  c^  and  a\,    respectively.   He 
later  extended  this  result  to  the  general  random-effects  model 
without  interactions.   Wald  (1947)  also  considered  the  mixed  model 


y  =  Xg  +  Zb  +  f 


(2.2) 


where  g  is  a  vector  of  fixed  effects;  b~N(0,  <r^I)  independently  of 
e~N(0,  <T^I).   Using  a  regression  analysis  approach,  Wald  was  able  to 
construct  an  exact  test  for  <^\^/<^c 

As  pointed  out  in  Spj0tvoll  (1968),  Wald's  idea  cannot  be  used  to 
test  variance  components  in  all  linear  models.   To  illustrate  this 
phenomenon,  Spj0tvoll  considered  the  random  two-way  model.   Based  on 


a  modification  of  Wald's  method  he  was  able  to  derive  an  exact  test 
for  the  variance  component  associated  with  the  interaction  effect. 
However,  only  under  the  assumption  of  additivity  could  he  construct 
exact  tests  for  the  main  effects  variance  components. 

By  modifying  an  approach  suggested  by  Graybill  and  Hultquist 
(1961),  Thomsen  (1975)  was  able  to  construct  exact  tests  for  the 
variance  components  in  an  unbalanced  two-way  random  model.   Thomsen 
did  this  by  first  making  an  orthogonal  transformation  as  outlined 
below.   Let  y  be  the  vector  of  cell  means,  then  in  matrix  notation  we 
have  (see  Thomsen  (1975,  p.  258)): 

y  =  /ilr-s  +  Big  +  B2^+Irs(^)+I  (2-3) 

where  Ij-g  is  the  vector  of  ones  of  dimension  rs;  Ij-g  is  the  rs  x  rs 
identity  matrix;  Bj  and  Bj  are  defined  as 

Bi  =  Ir®ls 

(2.4) 

B2  =  lr®Is 

and  r  is  the  number  of  rows  and  s  is  the  number  of  columns  in  the 
design.   Since  Aj =  B^B/  and  Aj = BjBj'  commute,  there  exists  an 
orthogonal  matrix  P  which  simultaneously  diagonalizes  Aj  and  A2. 
Further,  the  first  row  of  P  may  be  chosen  to  be  (l/^(rs)lps•   Making 
the  transformation 
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z  =  Py  (2.5) 

results  in  Thomsen's  "semi-canonical"  form.   Thomsen  used  the  last 
rs-l  elements  of  z  to  derive  exact  tests  for  cr^^  (the  variance 
component  associated  with  the  interaction  effect)  and  for  a^^   and  a ^ 
(the  main  effects  variance  components).   However,  as  in  Spj0tvoll 
(1968),  the  development  of  the  latter  two  tests  rested  on  the 
assumption  that  o-^^  =  0.   Also,  Thomsen  assumed  the  layout  contained 
no  missing  cells.   (In  a  later  section  he  dropped  this  assumption, 
but  still  assumed  the  design  was  connected.) 

In  1983,  Seely  and  El-Bassiouni  returned  to  Vald's  variance 
component  test.   In  their  paper,  necessary  and  sufficient  conditions 
were  given  under  which  the  Vald  test  can  be  used  in  mixed  models. 
They  also  developed  a  uniqueness  property  of  Vald's  test.   Thus  based 
solely  on  matrix  ranks,  one  can  determine  if  a  particular  variance 
component  test  is  a  Vald's  test.   They  applied  their  notion  to  the 
tests  developed  by  Spj0tvoll  (1968)  and  Thomsen  (1975).   They 
concluded  that  the  tests  derived  by  Thomsen  are  indeed  Vald's  tests, 
while  only  the  test  for  interaction  developed  by  Spj0tvoll  is  a 
Vald's  test. 

Although  both  Spj0tvoll  and  Thomsen  needed  to  assume  additivity  in 
their  development  of  exact  tests  for  the  main  effects  variance 
components  in  an  unbalanced  random  two-way  model,  Khuri  and  Littell 
(1987)  demonstrated  that  this  assumption  was  unnecessary.   Initially, 
their  work  proceeded  similarly  to  Thomsen  (1975).   However,  they  also 
made  a  particular  transformation  that  reduced  the  analysis  of  the 


unbalanced  model  to  that  of  a  balanced  one.   Their  idea  was  based  on 
a  scheme  for  resampling  from  the  error  vector.   After  making  this 
transformation  they  were  left  with  a  model  of  the  form  ^see  Khuri  and 
Littell  (1987,  p.  548)), 

w  =  PiBig  +  PiB2^  +  €*  (2.6) 

where  g~N(0,  ^^Ir);  ^~N(0,  (r^Ig);  £*~N(0,  n^-s-i)  where  <5  =  <rj^^ 
+  ^max  '^l   ^"^  ^max  ^^  the  largest  eigenvalue  of  the  matrix  PjKPi'  where 
K  =  diag(nll^nl2^•  •  •,n;^s)  and  Pj  consists  of  the  last  rs-1  rows  of  P 
(see  (2.5)).   Furthermore,  g,  ^  and  e*   are  mutually  independent.   B^ 
and  B2  are  defined  in  (2.4). 

After  partitioning  w  into  w=(wj^,  oj'p,    v'apY y    Khuri  and  Littell 
developed  exact  tests  for  cr^  and  (T^  based  on  (w^,  w^^)'  and  (w^, 
w'  a)',    respectively.   In  particular,  their  test  statistic  for  testing 
Ho:«r^  =  0  is  given  by 


P       '±'Q'±'Q/(r-l)  (2.7) 

'~y:,^-,^/[(r-l)(s-l)]' 


It   should    be   noted   that 


rank(PiBi :  PjBj)  =  r  +  s  -  2 


rank(PiBi)  =r-l 


rank(PiB2)  =  s-1, 


Consequently,  according  to  Seely  and  El-Bassiouni  (1983),  the  tests 
derived  by  Khuri  and  Littell  (1987)  for  the  main  effects  variance 
components  are  Vald's  tests  based  on  the  w  model  in  (2.6). 

One  attractive  feature  of  a  Wald's  test  is  that  when  applied  in  an 
unbalanced  one-way-random  effects  model,  it  may  be  interpreted  as 
being  "almost  equal  to  the  most  powerful  invariant  tests  against 
large  alternatives"  (Spj0tvoll  (1967,  p. 422)).   This  result  is  also 
hinted  at  in  Mostafa  (1967).   Also,  under  certain  conditions,  Mathew 
and  Sinha  (1988)  show  that  the  Vald  test  for  testing  the  variance 
component  associated  with  the  unbalanced  random  two-way  mixed  model 
without  interaction  is  the  uniformly  most  powerful  invariant  test 
under  a  certain  group  of  transformations.   This  is  more  fully 
discussed  in  chapter  four. 

Besides  the  above  mentioned  results,  other  exact  procedures  do 
exist  in  the  literature.   Khuri  (1987)  developed  an  exact  test  for 
the  nesting  effect's  variance  component  in  an  unbalanced  random  2- 
fold  nested  model.   The  idea  of  resampling  from  the  error  vector  was 
also  utilized  in  this  paper.   Verdooren  (1988)  also  analyzed  the 
unbalanced  random  2-fold  nested  model.   He  was  able  to  derive  an 
exact  test  for  p^   for  each  given  value  of  P2-      Here  p^    is  the  ratio  of 
the  nesting  effect's  variance  component  to  the  error  variance 
component  and  P2    is  the  ratio  of  the  nested  effect's  variance 
component  to  the  error  variance  component. 

Exact  tests  for  the  random  and  fixed  effects  in  some  unbalanced 
mixed  models  were  derived  in  Gallo  (1987)  and  Gallo  and  Khuri  (1990). 
Hocking  (1988)  also  considered  the  mixed  linear  model.   Although  he 
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was  more  interested  in  estimation  than  in  testing,  he  did  note  that 
his  estimators  could  be  used  to  develop  tests  of  hypotheses  for 
designs  with  moderate  imbalance.   These  tests,  however,  are  not 
exact.   Ve  only  mention  them  since  they  may  represent  reasonable 
alternatives  to  the  methodology  presented  in  Section  three  of  Chapter 
three. 

Turning  to  more  general  models,  Khuri  (1990)  developed  exact  tests 
for  random  models  with  unequal  cell  frequencies  in  the  last  stage. 
These  designs  include  all  cross-classification  random  effects  models 
with  no  missing  cells.   Utilizing  resampling,  Khuri  was  able  to 
construct  a  set  of  mutually  independent  sums  of  squares  that  are 
distributed  as  scaled  chi-squared  variates.   These  sums  of  squares 
were  then  used  to  test  hypotheses  concerning  the  variance  components. 
It  should  be  noted  that  not  all  variance  components  can  be  tested 
exactly  using  Khuri 's  procedure.   This  should  not  be  too  surprising, 
however,  since  this  phenomenon  occurs  even  for  some  balanced  random 
(or  mixed)  models  (cf.  Kendall  and  Stuart  (1968),  p.  83). 

Using  a  different  approach,  Ofversten  (1989)  was  also  able  to 
construct  exact  tests  for  the  variance  components  in  general  mixed 
linear  models.   His  idea  was  to  use  a  result  found  in  Allen  (1974)  to 
orthogonally  transform  the  model  matrices  into  so-called  layer 
triangular  form.   The  nice  thing  about  this  type  of  transformation  is 
that  when  combined  with  orthogonal  row  permutations  it  can  be  used  to 
reduce  matrices  to  ones  with  full  row  rank.   This  technique  is 
repeatedly  utilized  in  Ofversten's  work  to  produce  exact  tests  for 
the  variance  components  in  certain  mixed  models.   However,  when 
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certain  rank  conditions  are  not  satisfied  this  approach  cannot  be 
used  to  produce  the  exact  procedures.   When  this  occurs,  Ofversten 
relies  on  resampling  to  develop  his  exact  tests.   Ofversten 's 
methodology  can  be  used  in  any  unbalanced  mixed  model  to  construct 
tests  for  the  variance  components.   However,  it  is  not  altogether 
clear  as  to  how  to  adapt  his  methods  to  designs  not  examined  in  his 
dissertation.  More  is  said  about  Ofversten's  results  in  Chapter  four. 

Partial  motivation  behind  the  need  to  develop  exact  testing 
procedures  can  be  found  in  Tietjen  (1974)  and  Cummings  and  Gaylor 
(1974).   Both  of  these  papers  considered  the  random  2-fold  nested 
model.   In  his  simulation  study,  Tietjen  showed  that  the  approximate 
test  based  on  Satterthwaite's  (1946)  procedure  for  testing  the 
nesting  effect's  variance  component  can  be  markedly  off  in 
approximating  the  true  p-value.   This  led  him  to  recommend  the  use  of 
the  conventional  F-test  (that  is,  the  test  used  when  the  data  are 
balanced)  as  an  approximate  test  for  this  hypothesis. 

Cummings  and  Gaylor  (1974)  considered  various  2-fold  nested 
designs  in  their  simulations.   These  designs  were  specifically  chosen 
so  that  the  mean  squares  involved  in  the  approximate  (Satterthwaite) 
F-test  were  either  dependent,  failed  to  be  distributed  as  scaled  chi- 
squared  variates,  or  both.   They  demonstrated  that  when  the  mean 
squares  were  dependent,  the  estimated  test  size  was  less  than  the 
stated  test  size  and  thus,  the  approximate  test  was  conservative. 
When  the  mean  squares  were  independent  but  did  not  have  chi-squared 
type  distributions,  the  opposite  occurred.   The  estimated  test  size 
was  greater  than  the  stated  test  size.   When  the  mean  squares  were 
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both  dependent  and  not  distributed  as  scaled  chi-squared  variates,  a 
"counter-balancing"  effect  was  observed.   The  approximate  F-test  in 
this  case  seemed  to  perform  well.   However,  this  counteracting  effect 
may  not  occur  in  higher  level  designs.   As  Cummings  and  Gaylor  (1974, 
p.  771)  state  "there  is  a  possibility  the  test  size  disturbances  due 
to  mean  square  dependence  and  nonchisquaredness  will  provide  some 
reinforcement . " 

In  the  next  chapter  we  extend  the  work  of  Khuri  (1987,  1990), 
Khuri  and  Littell  (1987),  and  Gallo  and  Khuri  (1990).   The  idea  of 
resampling  from  the  error  vector  is  vital  in  our  development  and  is 
used  repeatedly  throughout  this  dissertation.   The  reader  is  referred 
to  Appendix  C  where  the  utility  of  this  approach  is  illustrated. 


CHAPTER  THREE 
EXACT  TESTING  PROCEDURES 


3.1.  The  Unbalanced  Random  Two-Vay  Model  with  Some  Cells  Missing 

3.1.1.   Introduction 

In  this  section  we  will  consider  the  unbalanced  random  two-way 
model  with  some  cells  missing.   Specifically,  we  will  concentrate  on 
the  following  model: 

yijk  =  /^  +  «i+/?j  +  («^)ij  +  ^ijk  (2-^) 

((ijj)^^;  k  =  1,2,  •  •  •  ,n-  •);  where  fi   is  an  unknown  constant  parameter; 
a-,    /?■,  (a/?)-  •,  and  (■   •.  are  independently  distributed  normal  random 

2     2     2        J   2 

variables  with  zero  means  and  variances  a^^,    cr^,    o-^g^    and  cr^, 
respectively;  f  is  the  set  of  subscripts,  (i,j),  which  label  the 
nonempty  cells  of  (3.1).   That  is, 

^={(i,j):   l<i<r,  l<j<s,  n.j>0}  (3.2) 

where  r  is  the  number  of  rows  and  s  the  number  of  columns  of  the 

design.   Note  that  S"  is  a  proper  subset  of  IXJ,  where  1  = 

{i:i = 1,2, • • • ,r};  J = { j : j = 1 ,2, • • • ,s} ;  and  X  denotes  the  Cartesian 

product. 
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In  matrix  form,  (3.1)  becomes 

y  =  /iln  +  Xig  +  X2^  +  X3(a^)+£  (3.3) 

where  v  is  the  vector  of  observations  of  dimension  n=   X)     "ii!  in 

-  (i,j)e:f 

is  a  vector  of  ones  of  dimension  n;  X^,  Xj,  and  X3  are  known  matrices 
of  zeros  and  ones  of  orders  nxr,  nxs,  and  nx(rs-p),  respectively, 
where  p  is  the  number  of  empty  cells;  a,    l3,    and  {a£)    are  column 
vectors  whose  elements  are  the  a^,    Py    and  (o/3)jj,  respectively 
((i,j)G^));  €  is  the  nxl  vector  of  errors.   Define 

V.  -=-3-  r  y-  -I  (3-4) 

for  all  (i,j)e^.   From  (3.1)  we  obtain 

yij  =  /i  +  ai+;9j  +  (a^)ij  +  fij  (3-5) 

((i,J)Gf),  where 

e.  .=^-  y  c   .,  (3.6) 

for  all  (i,j)e^.   In  matrix  form,  (3.5)  becomes 

y  =  /'lrs-p  +  Aig  +  A2^+Irs-p('^)+I  ^^•'^^ 

where  y  is  the  (rs-p)xl  vector  of  (filled)  cell  means;  Ij-s-p  i^  ^^^ 
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identity  matrix  of  order  rs-p;  Aj  and  Aj  are  known  matrices  of  zeros 
and  ones  of  orders  (rs-p)xr  and  (rs-p)xs,  respectively.   A^  and  A2 
can  be  expressed  as 


-'^1  ~-®^-S-p£ 


(3.8) 


A2  = 


(Is-pi=  0(s-pi)xpi)^i 

(Is-P2=  0(S-P2)XP2)^2 


(Is-p^=  ^s-p^)xpr)Er 


(3.9) 


where  p-  is  the  number  of  empty  cells  in  row  i  (i  —  1 ,2,  •  • • ,r) ; 

0.     N     is  a  matrix  of  zeros  of  order  (s-p-)xp-  (i  =  l,2,---,r); 
(s-pj)xp.  1    1 

and  E-  is  a  product  of  interchange  matrices  (i  =  1 ,2,  •  •  •  ,r) .   We  take 
(Is-p.:  0(3_p.)^p.)  =  Is  when  p^  =  0  ( i  =  1 ,2,  •  •  •  ,r)  . 


Example  3.1 


Consider  the  following  two-way  table,  an  "X"  represents  a  filled 


cell, 


X  X  X  X 
X  X~~  X 
X   X 
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Then, 


furthermore, 


r  =  3 

s=4 
Pi=0 

P2  =  1 

p  =  3 


Ej  =  I4  -  E3 


consequently, 


£2  = 


1 

0 

0 

0 

0 

0 

1 

0 

0 

1 

0 

0 

0 

0 

0 

1 

1 

0 

0 

0 

0 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

1 

0 

0 

0 

0 

0 

1 

0 

0 

0 

0 

1 

0 

1 

0 

0 

Aj  =  diag(l4, 13,12) 
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A2  = 


10  0  0 
0  0  10 
0  0  0  1 


(12-^2x2) 


Besides  the  usual  assumptions  concerning  the  random  effects,  we  will 
also  assume: 


(i)    The  model  used  is  model  (3.1)  (or  model  (3.3)) 

(ii)   Both  r  and  s  are  at  least  two 

(iii)  rank(Ai:A2)=r  +  s-l  (3.10) 

(iv)   p<(r-l)(s-l) 

(v)    n  >2(rs-p) -min(r,s) 

Assumption  (iii)  implies  that  the  design  is  connected  ^Gately  (1962) 
p.  46),  while  the  last  assumption  (along  with  the  others)  is  needed  to 
insure  that  the  exact  tests  for  the  main  effects'  variance  components 

are  valid. 

The  inclusion  of  missing  cells  into  any  design  adds  a  degree  of 
difficulty  to  the  analysis.   The  two-way  random  model  is  no 
exception.   One  problem  is  that  the  matrices  B^  =  A^Aj  and  Bj  =  AjAj  do 
not,  in  general,  commute.   B^  and  B2  will  commute  if  all  cells  are 
filled  (see  Thomsen  (1975)  or  Khuri  and  Littell  (1987)j. 
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Reconsidering  Example  3.1,  we  have 


where  Ja=lalk-   ^hile 


Bi  =  diag(J4,  J3,  Jj) 


62  = 


(I3=Q)E2 
(l2=0) 


=^o') 


(I2^o)e;(^;) 


hence 


B1B2- 


J3(l3'-Q)E2 


J3  J3(l3  =  Q)E2(-0-) 


J2(l2=0)     J2(l2=0)E2  (q/)        h 


Since,  for  example,  J4(o')  =  (j^)  ^^ile  J2(l2:0)  =  ( ^2  =  0)  ,  we  see  that  B1B2 
is  not  symmetric.   Thus  Bj  and  B2  do  not  commute.  Consequently  no 
orthogonal  matrix  exists  that  will  simultaneously  diagonalize  Bj  and 


B2. 
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The  next  section  lays  the  groundwork  necessary  for  the 
construction  of  the  exact  tests. 

3.1.2    Preliminary  Development 

Recall  (3.7).   From  the  distributional  properties  of  the  random 
effects,  we  can  conclude  that 


y~N(/ilj.s-p,  S) 


where 


E  =  Bi4  +  B2^^  +  Irs-p'^a/?  +  ^'^'  (^-^^^ 


and 


K==    e     njl.  (3.12) 

(i,j)€:f  'J 


Bi  =  AiAl 


^2  —  A2A2 


The  following  lemma  provides  the  ranks  of  B^  and  B2. 


(3.13) 
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Lemma  3 . 1 


(a)   rank(Bi)  =  r 


(b)   rank(B2)  =  s 


proof: 


(a)   rank(Bi)  =rank(AiAi)  =rank(AJ.   Since 


rank(Ai)  =(©  l^.p.)    (see  (3.8)) 
1  =  1   '^1  ^ 


=  y^(l)=r,  the  result  follows, 
i  =  l 


(b)   Without  loss  of  generality,  we  can  assume  that  each  column  (and 
row)  in  the  layout  includes  at  least  one  cell  containing 
observations.   Thus,  each  column  in  Aj  contains  at  least  one  "1" 
Hence  there  exists  a  permutation  matrix  H  such  that 


HA-, 


^s 


for  some  (rs-p-s)xs  matrix  $.   Consequently, 


rank(B2)  =  rank(A2A2) 


=  rank(A2) 
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=  rank(nA2) 


rank 


$ 


D 


Now,  let 


u  =  Hiy 


(3.14) 


where  Hj  consists  of  the  last  rs-p-1  rows  of  the  (rs  -  p)  x  (rs  -  p) 
Helmert  matrix  (see   Searle  (1982),  p.  7lj.   Ve  note  that 
Hilrs-p^Qrs-p-l*   Then,  it  is  easily  verified  that 


E(u)=Q^3_p_l 


i/_2 


Var(u)  =  U.B.ny^  +  n.B.Hy^  +  I^s-p-l^^  +  "i^Hi'^e 


(3.15) 


The  purpose  of  this  transformation  is  to  construct  a  vector  which  has 
a  zero  mean  vector.  The  ranks  of  HjBjH'i  and  HjBjH^  are  examined  in  the 
next  lemma. 

Lemma  3.2 


(a)   rank(HiBiH;)  =r-l 
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(b)      rankCHiBjH'i)  =  s-l 


(c)      rank(HiBiH'i-fHiB2Hi)  =r-t-s-2 


Proof: 


(a)      rank(HiBiH'i)       =  r(HiAiA;H'i) 


=  r(HiA,) 

<r(Aj)=r  (from  Lemma  3.1(a)j. 


However, 


^lir  ~t®ls-p-jir  "~  irs-p" 


Thus, 


^iSir  -  Hiirs-p  -  Qrs-p-1 


Therefore, 


rank(HiBiH;)  <r-l 


Conversely,    by   the   Frobenius   Inequality   (see   Lemma  A.l),    we  have 
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rank(HiBiH'i)  =  rank(HiAJ 


>  rank(Hj)  +rank(Aj)  -rank(Ij,s-p) 


p-l-fr-(rs-p)  =r-l. 


Consequently,    rank(HjBjHj)  =  r-  1 . 


(b)      Similar    to    (a) 


(c)      rankCHiBiH'i  +  HiBjH'i)    <  rank(HiBiH'i)  +  rank(HiB2H;) 


=  r  +  s-2        ffrom    (a)    and    (b)j. 


Conversely,  the  Frobenius  Inequality  gives  us 


ran 


k(HiBiH'i+HiB2Hi)  =  rank(Hi(Bi  +  Bj)!!;) 


=  rank(Hi(AiA;+A2A^)H'i) 


ranki 


(Hi[Ai:A2] 


a; 
a; 


h;) 


=  rank(Hi[Ai:A2]) 


>  rank(Hj)  +  rankf  [Aj :  Aj])- rank(Ij,s_p) 
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=  rs-p-l  +  r  +  s-l-(rs-p) 


r  +  s-2 


(by  assumption,  rank  H  Aj :  Aj]!  =  r  +  s  -  1  (see  (3.10)jj.  □ 

The  next  step  in  our  analysis  is  to  simultaneously  diagonalize 
HiBiH'i  and  H^BjH'^.   Because  B,^  and  Bj  do  not,  in  general,  commute, 
neither  will  ^i^i^i'i   and  HjB2Hi .   So  we  cannot  achieve  simultaneous 
diagonal ization  by  way  of  an  orthogonal  transformation.   However, 
both  HiBiH'i  and  H^BjIlj  are  n.n.d.   Consequently  a  nonsingular  matrix, 
M,  of  order  (rs  -  p- 1) x  (rs  -  p- 1) ,  will  exist  that  will  diagonalize 
both  HiBjH'i  and  H1B2H1  (see  Lemma  A. 4).   Define 

Ai^MHiBjH'iM'  (3.16) 

A2  =  MHiB2HiM'  (3.17) 

Both  Aj  and  Aj  are  diagonal  matrices  of  order  rs-p-1.   By  Corollary 
A. 4.1  of  Lemma  A,  we  can  express  A^  and  A2  as 

Ai=dia^I^_l,  O^.i,  0(^_,)(^_i)_p)         (3.18) 


A2 


=  diag(0^_l,  l3_i,  0(^_i)(3_i)_p).        (3.19) 


Consider,  then,  the  (rs-p-1) xl  vector  v  defined  as 
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v  =  Mu.  (3.20) 


It   is   easily   shown   that 


E(v)=0  (3.21) 


2    ,    A    „2     ,-eJl        ,   r-^2 


Var(v)  =  Ai<T^  +  Aj-rJ  -f  F<r^^  +  G<t^ 


where 


F  =  MM' 


G  =  MHiKHiM'. 


(3.22) 


Ve   can   partition   v    into 

Y  =  (Yi,   Y^,    Y^)',  (3.23) 

where  Yi  consists  of  the  r-1  elements  of  y  whose  covariance  matrix  is 
free  from  cr^;  Y2  consists  of  the  s-1  elements  of  v  whose  covariance 
matrix  is  free  from  cr\;    and  V3  consists  of  the  (r-l)(s-l)-p 
elements  of  y  whose  covariance  matrix  is  free  from  both  tr^^  and  ag. 
Hence, 
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e(y)  =Q 


(3.24) 


Var(v)=diag(I^_-L4,  ^s-l^'  0)  +  F^^/J +  G<r^ 


This  result  follows  from  (3.18),  (3.19),  and  (3.21). 

We  are  now  ready  to  develop  our  exact  tests.   To  begin  with,  we 

2 

develop  an  exact  test  concerning  a-^^. 
3.1.3   An  Exact  Test  Concerning  q-q 


Initial  development.   In  this  section  an  exact  test  for  testing 
Ho:(r^  =  0  vs.  E^'-ffl^^O   is  derived  by  utilizing  the  subvectors  v^  and  V3 
of  V  in  (3.23).   To  begin  with,  partition  F  and  G  ^see  (3.22)j  as 


F  = 


and 


11 


f; 


12 


12 


22 


Fis        F23 


13 


23 


33 


(3.25) 


Ml 


G' 


12 


'13 


"12 


'22 


Gn  Gn 


23 


'13 


'23 


'33 


(3.26) 


where  Fii(Gn),  F22(G22) »  and  F33(G33)  are,  respectively,  (r-l)x(r-l), 
(s-l)x(s-l),  and  ((r-l)(s-l)-p)x((r-l)(s-l)-p)  matrices.   Also, 
let  Vj.3  be  the  (s(r-l)-pjxl  vector  defined  as 


-27- 


Yi:3 


^1 
Y3 


(3.27) 


where  Vj  is  (r-l)xl  while  Vg  is  f(r- l(s  -  1)  -  pjx  1.   Then,  from 
(3.21)  we  have 


E(Yi.3)=Q 


(3.28) 


Var(vi.3)=AiC7^  +  Fi.3<7^^  +  Gi.3(r2 


where 


Ai  = 


Ir-1    0 


0     0 


(r-l)(s-l)-p 


(3.29) 


Fi:3 


f'li    f'la 


f^lS     f^33 


(3.30) 


^1:3- 


^11     ^13 


g; 


13     "33 


(3.31) 


From  (3.22)  notice  that  since  both  F  and  G  are  positive  definite 
(p.d.),  Fj.g  and  G1.3  are  also  both  p.d. 
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Now,  there  exists  a  nonsingular  matrix  N  or  order  s(r-l)-p  such 
that  (see  Lemma  A. 3) 


NF,:3N'  =  l3(,_i). 


(3.32) 


NAiN'  =  Aj, 


(3.33) 


where  A;^  is  a  diagonal  matrix  whose  diagonal  entries  are  solutions  G 
to 

|A,-eF,,3|  =0.  (3.34) 


:'-l 


Ve  note  that  if  Fj-lg  represents  the  first  r-1  rows  and  columns  of  F^.g 
then  the  nonzero  solutions  to  (3.34)  are  exactly  the  eigenvalues  of 


7II 
1:3- 


Without  loss  of  generality  we  may  express  Aj  in  (3.33)  as 


k\      0 
0   0 


(3.35) 


where  A*  is  a  diagonal  matrix  of  order  r-1  whose  diagonal  entries  are 
the  eigenvalues  of  FJ^.g.   (If  Aj  is  not  in  this  form  then  we  can  find 
an  orthogonal  matrix  F  such  that 


PAjP' 


A*   0 
0   0 


Thus,  letting 


we  see  that 
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N*  =  PN 


N*AiN*'  =  PNA^N'P' 


FAi?'    (see  (3.33)) 


A*   0 
0   0 


•) 


The  purpose  of  introducing  the  matrix  N  was  to  simultaneously 
diagonalize  A^  and  F-^.^   in  the  expression  for  Var(vi.3)  given  in 
(3.28). 

Define  the  s(r-l)-p  random  vector  x^.g  as 


?i:3  — 


- 

~ 

"<! 

=  N 

Yl 

^3 

Y3 

_ 

_ 

=  Nvi,3 


(3.36) 


where  x^  is  (r-l)xl  and  X3  is  ((r  -  1)  (s  -  1)  -  pjx  1 .   Then 


E(xi.3)=0 


Var(xi.3)  =Ai(7^  +  Ig(.^_-^)_ptr^^  +  Li(r2. 


(3.37) 
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This  follows  from  (3.28),  (3.32),  and  (3.33).   L^  is  the  (s(r-l)-p) 
x(s(r-l)-p]  full  rank  matrix 


Li  =  NGi.3N'.  (3.38) 

Ve  can  equivalently  express  Var(xj.3)  in  (3.37)  as 

Var(x,,3)=diag(Ar<r2^  +  I^_l<T2^^,  I(r_i)(s-l)-p'^a/?)+ ^1^^?-   (3-39) 

Because  Lj  is  not  diagonal,  Xj  and  X3  are  not  independent.   To  achieve 
independence  between  Xj  and  X3,  we  resample  from  the  error  vector. 
The  results  of  Lemma  B.6  are  utilized  heavily  in  the  forthcoming 
development. 

Utilizing  resampling.   Because  model  (3.1)  or  (3.3)  is  a  special 
case  of  model  (B.l)  the  error  sum  of  squares,  SSE,  associated  with 
this  model  can  be  expressed  as 

SSE  =  y'Ry  (3.40) 

where  R  is  the  nxn  symmetric,  idempotent  matrix 

R=Ijj-W(V'V)-W'  (3.41) 


V-[1^:X,:X2:X3]  (3.42) 


-al- 


and Xj,  Xj,  and  X3  are  described  in  (3.3).   Since 

range(ln)  C rangeCXg) 
range(X^)  Crange(X3)     (i  =  l,2), 

(3.41)  can  also  be  written  as 

R-=  In  "^^3(^3^3)   ^^3 


=  In-    ®    (Jn-./niJ.  (3-43) 


This  last  equality  follows  since 


X3=     e    In..-  (3-44) 

(i,j)€r  iJ 


Following  Lemma  B.6  (iv),  there  exists  an  orthogonal  matrix  C  and  a 
diagonal  matrix  T   such  that 

R^CrC.  (3.45) 

Partition  C  and  T   as 

C  =  [Cj :C2:C3] 

(3.46) 
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r  =  diag(I^^,I^^,0) 


where 


7?j  =  s(r-l)-p 


7J2  =  n  -2(rs  -  p)  -f  s, 


(3.47) 


We  note  that  r^j  +  7?2  =  n  -  (rs -p)  =  rank(R)  and  r/2  >  0  by  (3.10)(v). 
Ci,C2,  and  C3  are  matrices  or  orders  nxrj^,    nxr]^   and  nx(rs-p), 
respectively.   From  (3.46),  R  can  be  written  as 

R  =  CiC;  +  C2C^.  (3.48) 

Now,  consider  the  fs(r-l)-pjxl  random  vector  lj^.^   defined  to  be 


-l:3  =  >^i:3+(\,maxI,?i-Li)^Ciy  (3-49) 

1 
where  \   ^^x  is  the  largest  eigenvalue  of  L^  and  (-^i  ,inaxl7jj  "'^i)^  ^^  ^ 

symmetric  matrix  with  eigenvalues  equal  to  the  square  roots  of  the 

eigenvalues  of  (Aj^p^ax^r?  -^j),  which  are  nonnegative. 

Let  Wj.3  be  partitioned  just  like  Xj.3  in  (3.36),  that  is 


'i'i:3  -I  Wt 


^,.,=1^-1  (3-50) 


where  Wj  and  W3  are  of  orders  r-1  and  (r  -  l)(s  - 1)  -  p,  respectively. 
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The  major  result  of  this  section  is  given  in  the  following 
theorem. 

Theorem  3.1 

(a)  E(yi)=0;  £(0-3)  =  0 

(b)  Wj  and  W3  are  independent,  normally  distributed  random  vectors 
and  have  the  following  variance-covariance  matrices: 


Var(wi)=A*<T^  +  5iIr_l 


Var(y3)=^I(r-l)(s-l)-p 


(3.51) 


where  ^1  =  o-^yj  +  Aj  ^  ^ax  <^l- 


Proof: 


(a)   From  (3.49)  we  have 


E(yi:3)=E(>Si:3)  +  (^i,maxl7?i-Li)'C'iE(y) 

From  (3.37)  we  know  that  E(xi.3)=0.   Since  Rln=0  ^see  Lemma 
B.6(ii))  and  range(Ci)  C  range(R)  ,  we  have  that  C[ln  =  Q.      Thus  the 
ult  follows  after  noting  that  E(y)  = /iln  ^see  (3.3)j. 


res 
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(b)   Clearly  Wj  and  u^   are  normally  distributed.   From  Lemma  B.6(v)  we 
have  that  y  is  independent  of  SSE.   Consequently 

DQR:=0 

where  D  is  such  that  Dy  =  y,  and 


fi  =  Var  (y  )  =  \^\[al  +  \^\'^a''p  +  XjX^^^^  +  a^I^       (3 .  52) 


(see  (3.3)].   However,  since  range(Ci)  C range(R)  it  follows  that 

DfiCi  =  0 . 

Thus  y  is  independent  of  Cjy.   Since  Xj.a  is  a  function  of  y,  it 
must  also  be  independent  of  C^y.   Hence 

Var(y,:3)=Var(x,;3)  +  (A,^„3,^I^^-L0^CifiCi(Ai,^^^I^^-Li)2.    (3.53) 

From  Lemma  B.6(ii),  RfiR  =  RV^ .   Thus  since  range ( C^ )  C  range (R) 
and  R  is  idempotent  we  have 


Cir2Ci  =  C\C^a^  =  (T^l^^. 


The  last  equality  follows  from  the  fact  that  C=[C2:C2:C3]  is 
orthogonal.   Therefore,  (3.53)  can  be  expressed  as 
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Var(yi.3)  =  Var(xi  .3)  +  (Aj  ^n,axl/?i  "  Li)'^£ 

Consequently,  from  (3.35), 

Var(y,)  =  A*<r^  +  5il^_i 


Var(y3)  =^il(^_i)(3_i)_p 


where  Si-(^ap  +  \,ma,x<^l- 


As  a  result  of  Theorem  3.1  we  obtain 


D 


(i)   SSi=yi(A*4  +  5iI^_i)-Vi~x'r_l  (3-54) 

(ii)   SS3  =  y^W3/5i~x'(^_i)(3_i)_p  (3.55) 
(iii)  SSj  and  SS3  are  independent. 

A  test  statistic  for  testing  Hq:o'^  =  0  vs.  Hg^:<r£^^0  is  then 


F,= pllllzll ^.  (3.56) 

y^y3/((r-l)(s-l)-pj 
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Under  Hq,  ^i  ~  F^_i , (r-l)(s-l)-p-   ^°^^  ^^^^ 


<|^)  =  V^  +  ^ 


(3.57) 


where  Ij  is  the  average  of  the  eigenvalues  of  F\\j  (see    (3.35)j.   Since 
F1.3  (see  (3.30)]  is  p.d.  it  follows  that  Fj"}3  is  p.d.   Thus  Pj^.g  is 
p.d.,  implying  that  ^^>0.   Consequently,  we  would  reject  Hq  at  the  7- 
level  of  significance  if 

f^l>^,r-l,(r-l)(s-l)-p 


where  F     ^  /   -,  x  /  -,  s         is  the  upper  1007%  point  of  the 
7,r-l,(r-l)(s-l)-p 

corresponding  F-distribution. 

Ve  now  turn  our  attention  to  the  construction  of  an  exact  test 


concerning  cr 


/?• 


3.1.4   An  Exact  Test  Concerning  a 


P 


Initial  development.   This  section  is  concerned  with  the 
construction  of  an  exact  test  for  o-^.   Much  of  the  development  in 
this  section  parallels  that  of  Section  3.1.4  except  that  we  now  focus 
on  the  subvectors  Vj  and  Vg  of  v  in  (3.23). 

Let  V2.3  be  the  (r(s-l)-pjxl  random  vector 


Y2:3  = 


(3.58) 
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where  Vj  is  (s-l)xl  and  V3  (recall)  is  f (r-l)(s-l)-pjx  1 .   Similarly  to 
Section  3.1.3  we  can  make  a  nonsingular  transformation,  T,  to  obtain 
a  random  vector  X2.3=[x2:x3  ]  =Tv2.3  with 


E(X2:3)=Q 


Var(x2:3)  =  ^2^^  +  Ir(s-l)-p'^a/?  + Ve 


(3.59) 


where  A2  =  diag(A2,  0)  and  Aj  is  a  diagonal  matrix  of  order  s-1  whose 
diagonal  entries  are  the  eigenvalues  of  FW^.      F^^   represents  the 
first  s-1  rows  and  columns  of  F2.3  where 


2:3 


^^22    ^23 


^23    ^33 


(3.60) 


[see   (3.25)).      Also,    Lj   is   the  rr(s-l)-pjxfr(s-l)-pj  full   rank  matrix 


L2  -TGjjsT  . 


(3.61) 


T  is  the  nonsingular  matrix  that  simultaneously  diagonalizes  F2.3  and 


A2  where 


^s-1 
0    0 


(r-l)(s-l)-p 


(3.62) 


(see(3.24));  and  62:3  is  defined  to  be 
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^2:3 


'22 


g; 


23 


'22 


'23 


(3.63) 


(see  (3.26)). 

Ve  note  that  x| ,  defined  in  Xjjs  above,  is  not  the  same  as  X3  in 
(3.36)  since  a  different  nonsingular  transformation  was  necessary  to 
simultaneously  diagonalize  F2.3  and  Aj. 

Because  Xj  and  X3  are  not  independent  (Lj  is  not  diagonal  (see 
(3.39))),  we  will  again  need  to  employ  resampling  to  aid  in  the 
construction  of  the  exact  test. 

Utilizing  resampling.  Consider  again  the  matrix  R  defined  in 
(3.41)  or  (3.43).  Unlike  the  partitioning  of  C  and  T  described  in 
(3.46),  we  now  partition  C  and  F  as 


C  =  [Cr:C*:C3*] 


r  =  diag(I  .,1  *,0) 

VI   '/2 


(3.64) 


where 


r;*  =  r(s-l)-p 


7^2  =  n-2(rs-p)  -\-  r . 


(3.65) 


Note  that  7?* -f  77*  =  n-(rs-p)  =  rank(R)  and  7;*  >  0  by  (3.10)(v).   q,  CJ, 
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and  C3  are  matrices  of  orders  nxr]*,    nXT]2,    and  nx(rs-p), 
respectively.   From  (3.64),  we  can  express  R  as 


*n*t   I  o*n*i 


—  \j-t  \j-i    ~T  v>2  ^2 


(3.66) 


(see  also  (3.45)1. 

Now  define  the  fr(s-l)-p)xl  random  vector  w 


2:3 


^2:3-^2:3+  (•^2,maxln*~^)^^l  t 


(3.67) 


wh 


is  the  largest  eigenvalue  of  Lj  and  (Aj  max^  *~^2)^    ^^   ^ 


2 , max  — <=  '     "=■  .i      ^  ^,iua.^  jj_- 

symmetric  matrix  with  eigenvalues  equal  to  the  square  roots  of  the 
eigenvalues  of  (A2  max^  +"^2),  which  are  nonnegative. 
Let  W2.3  be  partitioned  just  like  X2;3,  that  is 


'i'2:3 


-3* 


(3.68) 


where  Wj  and  y|  are  of  orders  s-1  and  (r-l)(s-l)-p,  respectively. 
Theorem  3.2  represents  the  major  result  of  this  section. 

Theorem  3.2 

(a)   E(y2)=Q;  E(y*)  =  0 


(b)   ^2  and  W3  are  independent,  normally  distributed  random  vectors 
and  have  the  following  variance-covariance  matrices: 
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Var(w2)  =  A2<T^  +  «52ls_i 


Var(y|)=^2l(r-l)(s-l)-p 


(3.69) 


where  i5,  =  (t^^ -t- A,  m;,^^. 


'2 -"a/?  "^''2; 


max" e ■ 


Proof: 


The  proof  of  Theorem  3.2  is  very  similar  to  the  proof  of 
Theorem  3.1  and  is  thus  omitted.  ^ 

As  a  result  of  Theorem  3.2  we  obtain 

(i)   SS2  =  yi(A*4-h62ls-l)"V2~x|_i  (3-70) 

(ii)   SS3*  =  y*'y3V52~x|r_i)(3.i)_p  (3-71) 

(iii)  SSS2  and  SS|  are  independent. 

A  test  statistic  for  testing  Ho:(7^  =  0  vs.  H^io-^t^O  is  then 


F2  = 


^2^2/(^-1)      _  (3.72) 

y*'y3V((r-l)(s-l)-p)  ' 


Under  Hq,  ^2 ~  ^s-l , (r-l)(s-l)-p-   ^^^^  *^^* 

E{^)  =  -e,<ryS,  (3.73) 

where  ?2  is  the  average  of  the  eigenvalues  of  F2^.3.   Since  F2.3  {see 
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(3.60))  is  p.d.  it  follows  that  Fj^g  is  p.d.   Thus  F^^   is  p.d.  , 
implying  that  l2>0-   Consequently  we  would  reject  Hq  at  the  7-level 

of  significance  if  ^2 >  F^,s-1 , (r-l)(s-l)-p- 

A  test  for  the  variance  component  associated  with  the 
interaction  effect  is  developed  in  the  next  section. 


3.1.5   An  Exact  Test  Concerning  a^g 


In  this  section  we  develop  an  exact  test  for  a'^^o   by  using  V3  ^see 
(3.23)).   Ve  note  at  the  onset  that  this  test  is  equivalent  to 
Thomsen's  (1975)  test. 

From  (3.24) -(3.26)  we  have 

E(y3)=o 

(3.74) 
Var(v3)=F33tr^^  + 633^2, 

Since  v  is  a  function  of  y  (see   (3.14)  and  (3.20))  and  y  is 
independent  of  SSE  fLemma  B.6(v)),  v  is  also  independent  of  SSE.   Thus 
V3  is  independent  of  SSE  as  it  is  a  subvector  of  v.   Also,  it  clearly 
follows  from  (3.74)  that 

Y^(F33<^a/?  +  G33^?)"'Y3~x'(r-l)(s-l)-p-  ^^'^^^ 


Because  SSE/c^  ~  x^  _ /   _  n  a  "test  statistic  for  testing  Hq:(t^^  =  0 
Ha  =  '^a/?^0  is 
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.;<=^S/((^-l)(s-l)-p)^  (3.T6) 

SSE/(n-(rs-p)j 


Under  Hq,  ^3 ~  F(r-l)(s-l)-p,n-(rs-p) ' 


E 


Y3G33Y3 


(r-l)(s-l)-p 


Note  that 


=  Va/?  +  '^e  (3-77) 


where  ^3  is  the  average  of  the  eigenvalues  of  G33F33.  Since  both  GJ3 
and  F33  are  p.d.  ^3  >  0  (see  Graybill(1983,  p.  397)j.  Hence  we  would 
reject  Hq  for  large  values  of  F3.  In  this  case,  we  did  not  need  to 
resample  to  construct  the  exact  test. 

3.1.6   A  Numerical  Example 

An  automobile  manufacturer  wished  to  study  the  effects  of 
differences  between  drivers  and  differences  between  cars  on  gasoline 
consumption.   Four  drivers  were  selected  at  random;  also  five  cars  of 
the  same  model  with  manual  transmission  were  randomly  selected  from 
the  assembly  line.   Each  driver  drove  each  car  a  maximum  of  three 
times  over  a  40-mile  test  course  and  the  miles  per  gallon  were 
recorded.   For  technical  reasons  the  drivers  did  not  drive  the  cars 
the  same  number  of  times.   Unfortunately,  unforeseen  mechanical 
problems  forced  the  cancellation  of  some  of  the  test  drives.   The 
data  is  recorded  in  Table  3.1 

We  first  test  for  a  significant  interaction  effect  using  (3.76). 
We  find  that 
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F3  =  0.7167 
numerator  d.f.  =9 
denominator  d.f.  =22 
p-value  =  0.6885. 

Thus  there  does  not  appear  to  be  a  significant  interaction  effect. 
The  results  concerning  the  main  effects  variance  components  are  now 
presented. 

For  testing  Kq:(tI^  =  0,    regarding  the  driver  variance  component,  we 
use  (3.56).   For  this  data  we  find  that 

Fi  =261.1971 
numerator  d.f.  =3 
denominator  d.f.  =9 
p-value  < 0.0001 

The  test  for  the  car  variance  component  resulted  in  (see  (3.72)j 

F2  =  71.2372 
numerator  d.f.  =4 
denominator  d.f.  =9 
p-value <  0.0001 

All  calculations  were  performed  using  S-plus. 
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Table  3.1    An  example  of  an  unbalanced  random  two-way  model  with 
missing  cells 


A(driver) 


27.7      30.7 
28.5      30.4 


B(car) 


25.3      28.1      24.8      28.4 

25.2      28.9      25.1      27.9 

30.0  26.7 


33.6 

36.7 

31.7 

35.6 

33.7 

32.9 

36.5 

31.9 

35.0 

33.9 

31.8 

32.9 

(source:   Dr.  Andre  I.  Khuri) 


29.7 

29.2 

30.2 

28.9 

31.5 

29.2  32.4  31.8      30.3 

29.3  32.4  30.7      29.9 
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3.1.7   The  Power  of  the  Exact  Tests 

The  approximate  power  of  the  exact  test  concerning  o'q  is  derived 
in  this  section.   Similar  expressions  can  also  be  developed  for  the 
test  for  (t'q   and  the  test  for  a'^^g.      Our  development  is  based  on  the 
results  in  Hirotsu  (1979,  pp.  578-579).   The  reader  is  referred  to 
Appendix  D. 

Let  "^j  denote  the  power  of  the  test  for  cr^.   Then  from  (3.56) 


l~   y^y3/[(r-l)(s-l)-p]-  7  ,r-l ,  (r-1 )  (s-1 ) -p  ' 


(3.78) 


where  Haio-^T^O.   Under  H^,  u'■^uJ-^    is  distributed  like  a  random  variable 
of  the  form  (see   Box  ( 1954b) j 


"^-^  i  +  _2  ,  ,   ^„  (3.79) 


j=l  ^  -^ 


where  the  x-'s  are  independent  chi-squared  variates  with  one  degree 
of  freedom,  and  A*-  is  the  j*'*  diagonal  element  of  A*  ( j  =  1 ,2,  •  •  •  ,r-l) ; 
A*  is  defined  in  (3.35)  and  6-^    in  (3.51).   The  exact  distribution  of  T 
is  complicated.   To  obtain  approximate  power  values  let 


3   (r-l)(s-l)-p 


V[V^M  (3.81) 
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6i(r-l) 
h-       cf      S,r-l,(r-l)(s-l)-p 


(3.82) 


where    in   this   case 


c  =  6 


1   r-1      ^ 

E(AV  +  i) 


(3.83) 


f  = 


e\aV  +  i) 


_2 


e\a*.^  +  i)^ 


(3.84) 


1       <^a/?  +  '^l,max'^£ 


(3.85) 


Then 


*l  =  P(H>h|Ha)=P(F^  .f    >h) 


+  [p/{3(f  +  2)(f  +  4)B(if,if2)} 


<^-fr^'''^\# 


r^^o^r^^A,     2(f  +  f,)(f  +  4)      (f +  f, +  2)(f +  f,)' 
(f  +  2)(f  +  4)-      ,^,^/(,h)      +      {1  +  V(fh)}'^ 


(3.86) 


where 


f2=(r-l)(s-l)-p 


(3.87) 
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r-1   * 


-|2 


(3.88) 


and  B(mi,m2)  is  the  beta  function.   Ve  note  that  from  (3.86)  the  power 
of  the  exact  test  depends  on  the  design  parameters  •^i  ,max'^l '  '  " '^r-1 ' 
the  level  of  significance;  and  the  ratios  crQ/<T^  and  o-(^p/<^e   °^  ^^^ 
variance  components. 

3.1.8    Concluding  Remarks 


The  key  to  the  construction  of  the  exact  tests  for  the  main 
effects'  variance  components  were  the  random  vectors  w^.g  and  W2.3  in 
(3.49)  and  (3.67),  respectively.   Both  of  these  vectors  represent  a 
linear  combination  between  a  function  of  y  and  a  portion  of  the  error 
vector. 

Ve  note  that  Cj  in  (3.49)  and  C*    in  (3.67)  are  not  unique.   Thus 
the  values  of  the  test  statistics  Fj  and  Fj  (see  (3.56)  and  (3.72), 
respectively)  will  depend  on  the  choice  for  C^  and  C*,  respectively. 
However,  the  distributions  of  Wj.3  and  Wj.g  are  invariant  to  these 
choices.   It  is  in  this  sense  that  our  methodology  is  similar  to 
techniques  involving  resampling. 

Ve  also  presented  an  exact  test  for  the  variance  component 
associated  with  the  interaction  effect.   This  test  is  similar  to 
Thomsen's  (1975)  procedure.   Seely  and  El-Bassiouni  (1983)  point  out 
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that  Thomsen's  (1975)  test  for  a^^  is  a  Wald's  test.   Since  our  test 
is  based  on  the  same  degrees  of  freedom  as  Thomsen's  test,  we 


c 


onclude  that  our  test  for  a^^  is  also  a  Wald's  test.   Consequently, 


F3  in  (3.76)  can  be  equivalently  expressed  as 

R((a/?)lp,g,/?)/[(r-l)(s-l)-p] 
F  -  '^  ~ ~J- (3.89) 

^  SSE/(n-(rs-p)j 

where  R((a^)  |^,g,/?)  represents  the  sum  of  squares  for  (a^)  adjusted 
for  /i,  g,  and  /?.   Here,  g  and  /?  are  treated  as  if  they  were  fixed 
effects  (see  Seely  and  El-Bassiouni  (1983,  p.  198)j. 

Consider  again  Wj.3  defined  in  (3.49).   It  is  easy  to  verify  that 
Wj^.g  can  be  written  as 


where 


1 
yi.3  =  A^g  +  e*  (3.90) 


4=(f  si  (3-91) 


(see  (3.35)V  and 

a-m,  Ms(r-i)-p)-  (3-^2) 

<5i  is  defined  in  (3.51).   Now,  we  have 


1 

2 

rank(Aj)  =  r-1 
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and  (using  similar  notation  to  that  of  Seely  and  El-Bassiouni ) , 


fy  =  s(r-l)-p-(r-l)  =  (r-l)(s-l)-p 


kv  =  r-l, 


Thus,  the  test  statistic  F-^  is  a  Vald's  test  based  on  the  Wj.g  model. 
Likewise,  we  can  also  show  that  Fj  is  a  Vald's  test  based  on  the  lj^-.S 
model . 

3.2  The  Unbalanced  3-Fold  Nested  Random  Model 
3.2.1   Introduction 

The  unbalanced  3-fold  nested  random  model  can  be  expressed  as 

Yijk£  =  /^  +  «i  +  ^ij  +  ^ijk  +  ^ijk«  (3.93) 

(i  =  l,2,...,a;  j  =  l,2,-  •  •,b.;  k  =  1 ,2,  •  •  ■  ,c.  j ;  £  =  1 ,2,  •  •  •  ,n.  j^)  , 
where  /J  is  an  unknown  constant  parameter;  a^,    P^y    '''ijk  ^^^  random- 
effects  associated  with  the  first-stage,  second-stage  and  third-stage 
nested  factors,  respectively;  and  f ^  ;]^g  is  the  random  error  component. 
Ve  assume  that  a^ ,  P^-,    7ijk»  ^•"^  ^ijk«  ^^^   independently  distributed 
normal  random  variables  with  zero  means  and  variances  cr^,  <Ta,    o-^,  and 
(T^,  respectively.   Ve  assume  there  are  no  missing  cells.   If  n  is  the 
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total  number  of  observations  and  c  and  b  the  total  number  of  levels 
of  the  third-  and  second-stage  nested  factors,  respectively,  then 


"=?/ijk 


c=Ecii  (3.94) 


b=Ebi 
i 


9      1    "2 

We  are  concerned  with  constructing  exact  tests  for  cr^  and  (To. 
We  again  need  to  use  resampling  to  achieve  our  goal.   The  results 
given  in  Seely  and  El-Bassiouni  (1983)  will  allow  us  to  conclude  that 
our  tests  are  Wald's  tests.   Ve  note  that  an  exact  test  for  o-^  is 
known  to  exist  based  on  comparing  R(7|/j,a,/?)  to  SSE  where  R(7|^,a,/?) 
is  the  sum  of  squares  for  7  adjusted  for  /j,a,  and  /?  (treating  a,/?  and 
7  as  fixed  effects);  and  SSE  is  the  error  sum  of  squares.   That  is,  to 
test  Ho:(T?,  =  0  vs.  Ha,:<T^7^0  we  would  reject  Hq  for  large  values  of  F^ 
where 


R(7|/i,cr,/3)/(c-b) 
7-    SSE/(n-c) 


Under  Hq,  F^  ~  F^_^^  ^_^   (see,  e.g.,  Searle  (1971)j.   Resampling  was  not 
needed  to  develop  this  test. 

Now,  in  matrix  notation,  model  (3.93)  becomes 

y  =  plji+Xig  +  X2^  +  X37  +  e  (3.95) 
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where  y  is  the  nxl  vector  of  observations;  1^    is  a  vector  of  ones  of 
dimension  n;  Xj ,  Xj,  and  X3  are  known  matrices  of  one's  and  zero's  (see 
(3.97)  -  (3.99)];  a,    j3 ,    and  7  are  random  vectors  whose  elements  are 
the  a-,    3-   -,  and  7-  -i,,  respectively;  and  e    is  the  nxl  vector  of 
errors.   If  n-  and  n-  •  are  defined  as 


j,k  '^"^ 


(3.96) 


nij=Enijk   (i  =  lj2,  •  •  •  ,a;  j  =  1 ,2,  •  •  •  ,b^) 


then, 


Xi=eln.  (3.97) 

i=l   1 


a  ^i 

X2=.®   ®  In.  .  (3.98) 


i=l j=l    ij 


a  W  ^ij 
i  =  l  1=1  k=l    ijk 


X3=_e  .©   ®  in.  .,  (3.99) 


From  (3.95)  we  have  that 


y~N(pln,  fi) 


wh( 


fi=(©Jn.)4+(.®.Jn.  .)4+(.  ®  Jn,,,)'^7  +  In'^'-     (3.100) 
1   1      i,j   ij  ^^       i,J,k   iJK 
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Now,    define  y^  ::j^  as 


^ijk=%M'-ijk^  ^'-'''^ 


(i  =  l,2,---,a;    j  =  l,2,..-,b.;    k  =  1 ,2,  ■  ■  •  ,c.  j)  .      Then   from    (3.93)    we 
have 

yijk-^  +  "i+^ij  +  T'ijk+^ijk  (3.102) 

(i  =  1,2,- •  •  ,a;    j  =  l,2,  ■  •  •  ,bj;    k  =  1 ,2,  •  •  •  ,c.  j)  ,    where 

^       "ijk 

In  matrix  form,  (3.102)  becomes 

y  =  ;/lc  +  Aig  +  A2^+Ic7+I  (3.103) 


where  y    is    the   cxl   vector  of   means;    €    is   the   cxl   vector 

1  =  (Tjjj,Tjj2,  •  •  •  jf^u   „        )';    and  A^   and  Aj  are   defined  as 

^  abg, 


Ai=  I  Ic.  (3.104) 

i  =  l      1 


a      ^i 
A2=l      ©  Ic  (3.105) 

i^lj^r'^ij 
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where 


Ci  =  Ecn        (i  =  l,2,---,a).  (3.106) 

J 


Ve  note   that 


range(lc) C  range(Ai)  C  rangeCAj). 


In  fact, 


lc  =  Aiia  =  A2lb 


Ai  =  A2(l  lb.) 
1=1      1 


From   (3.103)   we  have 


y~N(/ilc,    S) 


where 


and 


(3.107) 


^  =  Va  +  V^+Ic'^7  +  Kc7£  (3.108) 


Ai=AiAi=  I  Jc.  (3.109) 

i  =  l      1 
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A.^A.A'^I   ©  Jc.  .  (3.110) 

'   '    i=l  j=l   iJ 


b.  C-. 
K=|   ®   e  nTl  .  (3.111) 

i  =  l  j=lk=l  ^J^ 


Besides  the  assumptions  concerning  the  random  effects,  we  will  also 


assume: 


(i)  The  model    used   is  model    (3.93)   (or  model    (3.95)) 

(ii)  b-a>0  (b   is   defined    in    (3.94))  (3.112) 

(iii)  n>2c-l  In  and   c  are  defined    in    (3.94)J 

(iv)  c>2b-l. 

The  last  two  assumptions  (along  with  the  others)  are  needed  to  insure 
the  validity  of  our  procedures.   Neither  is  overly  restrictive. 
Assumption  (iii)  can  be  satisfied  if,  for  example,  n^  •j^>2  for  all 
(i,j,k)  while  Assumption  (iv)  will  hold  if,  for  example,  c^j>2  for 
all  (i,j). 

3.2.2.   Preliminary  Development 

In  this  section  we  use  the  last  c-1  rows  of  the  cxc  Helmert 
matrix  (see  Searle  (1982),  p.  7lj  to  transform  model  (3.103)  into  one 
that  has  a  zero  mean  vector.   We  also  present  some  results  concerning 
the  ranks  of  the  model  matrices  in  the  transformed  model. 
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Let  Hj  be  the  (c-l)xc  matrix  resulting  from  the  deletion  of  the 
first  row  of  the  cxc  Helmert  matrix.   Hj  has  orthonormal  rows  and  is 
such  that  Hilc  =  Q^_j^.   Define  the  (c-l)xl  vector  u  as 

u=Hiy.  (3.113) 


Then, 


E(u)=HiE(y)=/iHilc  =  Q^_l 


Var(u)  =HiAiH'i(r^  +  HiA2H'i(T^  +  I^_i(r2 +La2         (3.114) 


where 


L  =  HiKH;.  (3.115) 


Recalling  (3.104)  and  (3.105),  it  is  easily  seen  that 


rank(Aj^)  =  a 


(3.116) 


rank(A2)  =  b. 
The  next  lemma  provides  information  on  the  ranks  of  HjA^H^  and  H^AjH^ . 
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Lemma  3 . 3 

(a)  rank(HiAiH'i)  =a-l 

(b)  rankCHiAjH'i)  =b-l 

(c)  rankCHjAiHi+HiAjH;)  =b-l. 
Proof : 

(a)      rank(HiAiH'i)  =  rank(HiAiA;H'i) 

=  rank(HjAj) 

<rank(Ai)=a.  (see    (3.116)) 

However,  from  (3.107),  l^  =  k^lg^.      Since  H^lj- =  O^j  it  follows 

that  the  a  columns  of  H^^Aj  are  linearly  dependent.  Thus 

rank(HiAi)  <a-l  =>  rank(HiAiH'^)  <  a- 1 .   Conversely,  by  the 
Frobenius  Inequality  (see  Lemma  A.l),  we  have 

rank(HiAiHi)  =rank(HiAi) 

>  rank(Hj)  +rank(Aj)  —  rank(I(-) 
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=  c  —  1+a  —  c  =  a  —  1. 
Thus,    rank(HiAiHi)  =a-l. 

(b)  Similar   to    (a). 

(c)  Since   range(AJ  C  range(A2)    we   have    that   range(HiAi)  C  range(HiA2) 
Consequently, 

range    {]iiK^A[Yi[)  =  range(HiAi)  C  range(HiA2)  =  rangeCHiAjA^H'i)  . 
Thus   there   exists  a  matrix  B   such   that 

HiAih;  =  (HiA2h;)b. 

(Since  both   HjAiHi   and  HiAjH^   are  n.n.d.      B  must  be   n.n.d.) 
Hence, 


ran 


kCHjAiHi  +  HiAjHi)  =rank[(HiA2H'i)B  +  HiA2Hi] 


=  rank[HiA2Hi(B-hI)] 

=  rank(HiA2H'i)  (  B+l  has  full  rank) 

=  b-l  (see    (b)).  D 


i 


-58- 


3.2.3   An  Exact  Test  Concerning  (P' 


/? 


Introduction.   In  this  section  we  construct  an  exact  test  for 
the  variance  component  associated  with  the  second-stage  nested 
factor.   The  idea  is  to  first  make  a  transformation  of  model  (3.113) 
based  on  resampling  from  the  error  vector.   This  transformation  will 
result  in  a  model  that  is  a  special  case  of  the  model  described  in 
Seely  and  El-Bassiouni  (1983,  p.  198).   We  will  then  apply  the 
results  in  Seely  and  El-Bassiouni 's  Equation  3.2  to  develop  the  exact 
procedure. 

Utilizing  resampling.   Recall  model  (3.95).   Let  V  be  the 
partitioned  matrix 

V=[ln:Xi:X2:X3]  (3.117) 

and  let  R  be  the  matrix 

R=I-W(W'V)-V'.  (3.118) 

Since 

range(ljj)  C  range(Xi)  C  range(X2)  C  range(X3)  , 
(3.118)  can  be  expressed  as 

R  =  I„-X3(X^X3)-iX^.  (3.119) 
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The  error  sum  of  squares  associated  with  model  (3.95)  is  given  by 

SSE  =  y'Ry.  (3.120) 

It  is  known  that  R  is  symmetric,  idempotent,  and  has  rank  n-c. 
Furthermore,  SSE/o-^  has  the  chi -squared  distribution  with  n-c  degrees 
of  freedom  independently  of  y  (see,  e.g..  Lemma  B.6).   We  can  express 

R  as 

R  =  CrC'  (3.121) 

where  C  is  an  orthogonal  matrix  and  T  is  a  diagonal  matrix  whose 
first  n-c  diagonal  elements  are  equal  to  unity  and  the  remaining  c 
entries  are  equal  to  zero.   By  assumption  (3. 112) ( iii ) ,  we  can 
partition  C  and  T   as 

C  =  [0^:02:03] 

(3.122) 
r  =  diag(I^^,I^^,0) 

where 

(3.123) 
T}2  =  n-2c+l  >  0 
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and  Cj,  C2,  and  C3  are  matrices  of  orders  n  x  t/j  ,  nxrjj*  and  nxc, 
respectively.   Note  that  tj^ +  7/2  =  n  -  c  =  rank(R)  . 

From  these  partitionings  and  the  fact  that  (recall  C  is 
orthogonal) 


C'.C.  =1     (i  =  1,2,3) 


C'iC.  =  0     (i#j), 


(3.124) 


we  obtain 


R  =  CiCi-|-C2Ci.  (3.125) 


Now,  define  the  (c-l)xl  random  vector  y  as 


y  =  y  +  (\axi,,i-L)^y  (3-i26) 


where  A^^^^  ^^  the  largest  eigenvalue  of  L  fsee  (3.115)1  and  C'^  is  the 

nxr;i  matrix  defined  in  (3.122).   The  matrix  (Amax^r?,  ~  ^)  ^^  p.s.d. 

1 
Hence  (A^jax^T?  ~L)2  is  well-defined  with  eigenvalues  equal  to  the 

square  roots  of  the  eigenvalues  of  i^jaa,x^ri   ~'^)»  which  are 

nonnegative.   The  first  major  result  of  this  section  is  given  in  the 

next  theorem  which  provides  some  distributional  properties  for  lj. 
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Theorem  3.3 


The  random  vector  w  is  normally  distributed  with 


E(y)-Q^_i 


(3.127) 


Var(w)  =  HjAiHia^  +  HiKiK'^jJ  +  ^^c-l 


9  1 

where  ^  =  ""-y  + -^max^^c  ■ 


Proof : 


The  fact  that  u   is  normally  distributed  is  clear.   Ve  know  that 
E(u)=0  (see  (3.114)).   From  Lemma  B.6  (ii)  we  have  that  Rln  =  Q. 
Since  range(Ci)  C range(R) ,  it  follows  that  Cilu  = 0  (recall  that  R  is 
symmetric) .   Hence 


E(y)=E(u)  +  (A„a^I^^-L)2CiE(y) 


=  Q  +  (^maxl7,i-L)^C;(ln)/i 


=  0. 


Now,  we  claim  that  u  and  C^y  are  independent  in  the  expression  given 
for  w  in  (3.126).  To  verify  this  recall  that  SSE  is  independent  of  y 
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(see  Lemma  (B.6)(v)j.   Consequently, 


Df}R  =  0 


where  D  is  such  that  Dy  =  y,  and  Q  is  defined  in  (3.100).   Since 
range(Ci)  C range(R)  it  follows  that 


DQCi  =  0 . 


Thus  y  is  independent  of  C'^y.   Since  u  is  a  function  of  y,  u  is  also 
independent  of  C^y  as  claimed.   Hence, 


Var(y)=Var(u)  +  (A^a^I^^-L)^fiCi(A„^^I^^-L)2.      (3.128) 


From  Lemma  B.6(ii),  RQR  =  RV^.   Thus  since  range ( Cj )  C  range (R)  and  R 
is  idempotent  we  have 


C-jfiCj  =  CjCj^  CTj  =  <Tf  I 


^l' 


The  last  equality  follows  from  (3.124).   Therefore  we  can  express 
(3.128)  as 

Var(w)  =  Var(u)  +  (A^^xl/?!  "  ^)''l 
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=  HiAiHi4  +  HiA2Hia^  +  5I^_l 


where  5  =  <^^  +  Vax'^e  ■ 


From  Theorem  3.3  we  note  that  model  (3.126)  can  also  be  written  as 

w  =  Big  +  B2^  +  f*  (3.129) 

1 
where  B^  =  HjAj ;  B2  =  HiA2;  and  e*  =  HjT  +  HJ+ (A^axlr/^  -  L)2C;e  ~  N(0,(5I^_^) 

independently  of  g  and  /?.   This  model  in  the  form  required  by  Seely 

and  El-Bassiouni  (1983,  p.  198).   We  are  now  in  position  to  construct 

an  exact  test  for  testing  EQ:aa  =  0   vs.  Ug^-.ffg^O. 

The  exact  test.   Let  us  start  with  some  notation.   SSE^^  will 

denote  the  error  sum  of  squares  associated  with  model  (3.129).   That 

is, 

SSEy  =  y'(l^_l-[Bi:B2]([Bi:B2]'[Bi:B2])-[Bi:B2]')i'.     (3.130) 

Also,  let  R^;(^|g)  denote  the  sum  of  squares  for  P   adjusted  for  a 
(treating  a   and  /?  as  fixed)  in  model  (3.129).   Thus 

Ru-(^k)=y'(^c-l-Bi(BiBirBi)y-SSEy.         (3.131) 

Since  range(Bi) C range(B2) ,  (3.130)  and  (3.131)  can  be  equivalently 
expressed  as 
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SSEy  =  y'(l^_l-B2(B^B2rB'2)c.  (3.132) 

Ry(^|g)  -y'(^2(B^B2rB^-Bi(BiBirBi)y.  (3.133) 

The  second  and  last  major  result  of  this  section  is  given  in  the  next 
theorem. 


Theorem  3.4 


/  A  ^-l 


a)  A  g-inverse  of  B'jB j  ^  A'jH'iHiA j  is  given  by  (A'jHiHjAj)  =  (AjAj) 
(j  =  l,2). 

b)  Ry(^l«)/«~Xb-a'  "°^e^  Ho:cr^  =  0 

c)  SSEy/5~x'c_b 

d)  R(^(/3|g)  is  independent  of  SSE^^ 

Proof: 

a)  Since  HjHi  =  I^_;^  and  Hilc  =  Q^_^.  It  follows  that  HiH^^Ic-^Jc 
where  Jc  =  lclc-  Also  recall  that  l^,  =  A^l^  =  Ajl^  (see  (3.107)j. 
Thus 


J  1  1  J     J-.  C   C  C^   J 


A'.H;HiA.=A'.(Ic-ijc)A 
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^?j-^^ycAj 


^?j-^^?jVjAj  01.2), 


where 


=  {f  '.I   ^rl  (3.134) 

Lb  if    j=2. 


Therefore, 


=  (A^Aj-iA^AjJ,A^Aj)(A^Aj)-(A^A.-iA^AjJ,A^A.) 


(I,-iA'jAjJ,)(A'jAj-iA'jAjJtA'jAj) 


A^A.-iA^AjV^Aj+^A^AjJ,A^AjJ,A^Aj 


A'jAj-iAyeAj+^Ay,AjJ,A'jAj 


^J^J-^^ycAj  +  ^Ay,J,Aj 


A'  A     —  Ia'.  T    a  . 


Aj(Ic-K)Aj=A'/AAj. 
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Although  (b),  (c),  and  (d)  follow  from  Seely  and  El-Bassiouni ' s 
(1983)  Equation  (3.2),  we  nevertheless  prove  these  results  for 
the  sake  of  completeness. 

b)    First  note  that  (3.133)  can  be  expressed  as 

y'(P2-Pi)y  (3.135) 

where 

Pj  =  HiAj(A'jAj)-U'jHl   (j  =  l,2).  (3.136) 

We  need  to  verify  that  (Pj  -  Pi)fvar(w)  j/<5  is  idempotent  of  rank  b- 
a  when  <^^  =  0  (see  Lemma  (B.4)j.   Recall  from  Theorem  3.3  that 

Var(w)  =HiAiA'iHi(7^  +  HiA2A^Hicr^  +  (5I^_^. 

Since  p.  is  idempotent  of  rank  t-1  (see  part  (a)  of  this  theorem, 
Lemma  (3.3),  and  (3.134))  and  range ( Pj )  C range (Pj)  it  follows 
that  there  exists  a  matrix  O  such  that 


Pi  =  P2O 


O'Pj.     (by  symmetry) 
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Thus 

PlP2  =  0'P2P2  =  0'P2  =  Pi. 

Consequently,    P2-P1    is    idempotent  and  has   rank    (b-1)  -  (a-1)  = b-a. 
Now,    since   range(HjAjAiHi)  C  range(HiA2A2Hi)  =  range(P2)  ,    we   have 

(P2-Pi)(Var(y))=(I-Pi)P2Var(y)  (PaP2  =  Pi) 

=  ( I  -  Pj )  [HjAiAlHi^  +  E^A^K'^ny^  +  ^Pj] 

=  (I  - Pi)HiA2A^Hi<T^  +  (5( I  - PjPj  (3 .  137) 

=  (5(P2-Pi)  (when   <r^  =  0). 

Formula  (3.137)  follows  since  range(HiAjAjH^)  = range(P^)  and 
(I-Pi)Pi=0.   Consequently,  (Pj  -  Pi)fvar(w)y(5  is  idempotent  of 
rank  b-a.   Thus  from  Lemma  B.4  we  have 

Ry(^l«)/'5~Xb_a    ""*1^^  Ho:<T^  =  0. 
c)    SSEjj  in  (3.132)  can  be  expressed  as 

SSE^  =  y'(I^_-L-P2)y. 


-68- 


In  the  expression  for  Var(y),  range ( HjAjA'iHi )  C  range (HjAjAjHi) 
=  range(P2).   Thus  since  (I-P2)P2  =  0  it  follows  that 
(I-P2)(Var(w)y^=  I-P2  and  therefore  has  rank  (c-1)  -  (b-1)  =  c-b. 
Thus  from  Lemma  B.4, 

SSEy/5~x'c_b- 

d)    To  show  that  R^(;5|g)  is  independent  of  SSE^^  we  need  to  verify 
that 

(P2-Pi)(var(y))(I^_l-P2)=0 

(see  Lemma  B.2).   From  (c)  we  have 


(Var(y))(I-P2)=i(I-P2) 


Thus 


(P2-Pa)(Var(y))  =  KP2-Pi)(I-P2)=0. 


The  last  equality  follows  since  range(Pi) C range(P2) 
Consequently  Ry(/?|g)  and  SSE^  are  independent. 

Ve  can  conclude  from  Theorem  3.4  that 


Rw(/?|a)/(b-a) 
"/?=  'sSE;/(c-b)  -"b-^.c-b  (3.138) 
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under  HQ:cr^  =  0.   Ve  would  reject  Hq  at  the  u   level  of  significance  if 

Fa>F  1      .  where  F  .    „  k  is  the  upper  100j/7o  point  of  the 
p    t/jb-ajC-b        i/,b-a,c-b         '^'^ 

corresponding  F  distribution.   The  fact  that  we  reject  Hq  for  large 
values  of  our  test  statistic  follows  from  the  next  lemma. 


Lemma  3 . 4 

a)  tr[(I-Fi)HiA2A^H;]>0 

b)  rank[(I-Pi)HiA2A^H'J  =b-a,  where  (I -P^HjAjA^Hi  is  part  of  the 
expression  for  (Pj  -  Pj)[Var(w) j  in  (3.137). 

Proof: 

a)  Both  (I-Pj)  and  ^^k^k'^^'^   are  n.n.d.   Thus  tr[(I -PjHiAjA^Hi]  >  0 
fCraybill  (1983,  p.  397)V   However,  since  range ( P^ )  C  range (Pj) 

=  range (HjAjA^H'i)  it  follows  that  (I -Pi)HiA2A^Hi  ^  0  (if  it  were 
zero  then  range (H^AjAjH^)  would  be  contained  in  range(Pj),  a 
contradictionV   Hence  tr[(I  -  Pi)HiA2A^H'J  >  0  fsee  Graybill  (1983, 
p.  397)]. 

b)  First  note  that 


P2HiA2A^H'i  =  HiA2A^h;, 


Thus,    since   P1P2 =  ^i 
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(I-Pi)HiA2A^Hi  =  (P2-Pa)HiA2A^Hi. 


Consequently, 


rank[(I-Pi)HiA2A^H;]=rank[(P2-Pi)HiA2A^Hi] 


<  rank(P2-Pi)  =  b-a. 


Conversely,  by  the  Frobenius  Inequality  (see  Lemma  A.l), 


rank[(I-Pi)HiA2A^H'J>rank(I-Pi)-|-rank(HiA2A^H'i)-rank(I^_;^) 


=  c-a  +  b-1-  (c-1) 


Hence,    rank[(I  -  P^H^AjAjHi]  =  b-a. 


D 


If  ^^(>0)  denotes  the  average  of  the  nonzero  eigenvalues  of 
(I-Pl)HiA2A^Hi  then  from  (3.137) 


E 


M^le) 


=  tr[(P2-Pi)Var(y)]/(b-a)=^^,7^  +  ^. 


Ve  note  that  Oa   is  an  average  since  rankl(I  —  P2)HjA2A2HU  =  b-a  by  (b)  of 
this  lemma.   Ve  now  develop  an  exact  test  for  a^. 
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3.2.4.   An  Exact  Test  Concerning  a"^ 


Introduction .   In  this  section  an  exact  test  is  constructed  for 
the  variance  component  associated  with  the  nesting  factor.   Similarly 
to  Section  3.2.3,  we  would  like  to  use  model  (3.129)  to  develop  our 
exact  procedure.   Unfortunately,  because  range ( Bj )  C range (Bj)  in 
(3.129),  we  have 


Ily(«  I P)  =  y'(lc-l  -  B2(B^B2)-B^)y  -  SSE, 


Thus  another  approach  is  necessary.   The  problem  here  is  similar  to 
the  problem  we  would  have  encountered  if  we  had  tried  to  use  the 
original  model  (3.95)  to  construct  an  exact  test  for  ag.      Namely 
because  range(X2)  C  range(X3) ,  R(/?|/i,g,7)  =  0. 

Ve  overcame  this  problem  by  first  averaging  over  the  last 
subscript,  then  transforming  the  resulting  model  into  one  that  had  a 
zero  mean  vector.   Finally,  another  transformation  was  made  based  on 
resampling  from  the  error  vector  to  arrive  at  model  (3.129).   From 
here  we  were  able  to  derive  an  exact  test  for  ag. 

The  same  idea  can  be  used  again  to  develop  an  exact  test  for  (r^. 
Now,  however,  our  starting  model  is  model  (3.129).   This  model  is 
similar  to  an  unbalanced  random  two-fold  nested  model  (without  an 
intercept  term)  in  the  sense  that  it  enjoys  all  the  same  mathematical 
and  statistical  properties  of  such  a  model. 
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Initial  development.   Our  first  step  will  be  to  "average  over 
the  last  subscript".   This  can  be  accomplished  by  considering  the  bxl 
random  vector  w  defined  as 


y=(B^B2)  B^y 


=  (A^A2)"UXw.  (3.139) 

The  last  equality  follows  from  Theorem  3.4(a). 

Next  we  transform  model  (3.139)  into  one  that  has  a  "zero  mean 
vector."   Although  E(w)=0,  it  is  still  convenient  (for  mathematical 
reasons)  to  make  this  transformation.   Let  H*  be  the  last  b-1  rows  of 
the  bxb  Helmert  matrix.   Then  HJ'HJ"  =  I^^_^  and  Hj^l^^  =  Oj^_^ .   Define  v 
as  the  (b-l)xl  vector 

Y  =  H*w.  (3.140) 

The  mean  and  variance  of  v  is  provided  in  the  next  lemma. 
Lemma  3 . 5 


0    E(v)=Ob_i 


b)   Var(v)=H*(|^Jb.)ir'^^  +  Ib_i<^^  +  Hi*(A^A2)-lH*'5  (3.141) 
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Proof: 

a)  From  Theorem  3.3,    E(w)=0^_]^.      Thus 

E(y)  =H*E(y) 

b)  Also  from  Theorem  3.3, 

Var(w)  =HiAi(T^  +  HiA2H;<7^  +  iI^_l. 

Thus, 

Var(v)  =H*[Var(y)]H*' 

=  H*(A^A2)-U^H;[Var(y)]HiA2(A^A2)-^H*' 

=  H*(A^A2)-U^H;[HiAiH^4  +  HiA2Hi<r^  +  ^I^_l] 
xHiA2(A^A2)-^H*' 


Now,    from    (3.107),    Aj  =  A2(    ®  Jk  V?-      Therefore 

^  1=1      1' 


n^lX^^J^®  Jh.y^K-  (3.142) 
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Also, 

=  H*(A^A2)-^[A^A2-iA^JcA2] 

=  H*(A^A2)-^[A^A2-iA^A2JbAiA2]   (from  (3.107)) 

=  H*.      (recall  that  H*lj^  =  0)  (3.143) 

Consequently,  from  (3.142)  and  (3.143), 

Var(v)  =  [H*(  I  J,  kH;<T^  +  H*A^Hi4 +  H*(A^A2)-U^H;5]HiA2(A^A2)-^H*' 

=  K(    ®  Jb  )HM  +  HrHr4  +  H*(A^A2)-iH*'5 
^  i=l  i^  ^ 

^  1=1  l'  ^ 

Our  last  step  will  be  to  resample  from  the  error  vector.   In 
this  case,  the  error  vector  is  R*w  fR*  =  I^_^-P2  j  since  the  error  sum 
of  squares  for  the  w  model  [see  (3.129))  is  given  by  SSE^^  =  w'(I^_j-P2)w 

Utilizing  resampling.   Noting  that  R*  is  symmetric,  idempotent, 
and  has  rank  c-b,  we  can  write  it  as 
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R*  =  C*r*C*'  (3.144) 

where  C*  is  orthogonal  and  F*  is  diagonal  whose  first  c-b  diagonal 
entries  are  equal  to  unity  and  the  remaining  entries  are  equal  to 
zero.   Furthermore  C*  and  F*  can  be  partitioned  as 


C*  =  [Ct:C*:C3*] 


r*  =  diag(I  .,1  *,0), 


(3.145) 


where 


f  =  b-l 


'?! 


J72  =c-2b+l. 


(3.146) 


Ve  note  that  q"^ +  r]^  -  ra.nk(R*)   and  r]2>0   by  (3.112)(iv).   Also,  C*,  C2, 
and  C3  are  matrices  of  orders  (c-l)  xtj* ,    (c-l)xr/2,  and  (c-l)xr/j, 
respectively,  satisfying 

Cl'Cl^l  (i  =  1,2,3) 

(3.147) 


q'C*j  =  0      (i^j) 
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Thus, 


R*  =  c;fc;f'  +  c^c^'.  (3.i48) 


Define   the    (b-l)xl   random  vector  r  as 


I  =  Y+(ALxI   *-L*W'y  (3.149) 


where  A*3^^  is  the  largest  eigenvalue  of  L*  =  H*(A2A2)  ^H*'  and 

1 
(A*3^j^I  :^  — L*)2  is  a  well-defined  matrix  with  eigenvalues  equaling  the 
'?! 

square  roots  of  the  eigenvalues  of  (Amg^^I  +  — L  ),  which  are 
nonnegative.   The  major  result  of  this  section  is  given  in  the 
following  theorem. 


Theorem  3.5 

The  random  vector  r  is  normally  distributed  with  a  zero  mean 
vector  and  a  variance-covariance  matrix  given  by 

Var(r).=  H*(  I  Ju  X*V^  +  5*I,  .  (3.150) 

^  i  =  l   i^ 


where  ^*  =  o"^  +  A* ^^"^ . 
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Proof : 


Clearly  r  is  normally  distributed.   From  Lemma  3.5  we  know  that 
E(v)=Oi_-..   E(y)  =  0   i  from  Theorem  3.3.   Thus 

E(r)=E(v)  +  (A;;^^I^,-L*)2C*'E(y)   (see  (3.149)) 


Ve  claim  that  v  =  H*w  =  (A^A2)~U^H'iW  fsee  (3.140)  and  (3.139))  is 
independent  of  C*'w  in  (3.149).   To  verify  this  consider 

(A^A2)-U^Hi[Var(y)]R*=(A^A2)-U^H;[Var(y)](I-P2) 

=  (5(A^A2)"U^H'i(I-P2)   (recall  (3.127)  and  (3.136)) 

=  0. 

The  last  equality  follows  since  rangel  (A2A2)~^A2Hj J =  range(P2)  .   Since 
range ( C* ) C range (R*)  it  follows  that 

(A^A2)-U^Hi[Var(a.)]C*  =  0. 

Thus  V  is  independent  of  C*'w  as  claimed.   Consequently,  from  (3.149) 


Var(r)  =Var(v)  +  (A;^^I  ,-L*)5cr[Var(y)]C*(A*^^I  ,-L*)^  . 

^1  '?! 
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Now,    recalling  that   [Var(w)]R* = 5R*  (see    (3.127)    and    (3.136)), 
C;^'[Var(w)]C*  =  C*'R*[Var(y)]R*Cj'  (from    (3.148)) 


6C*'R*C*  (R*    is    idempotent) 


(SCj^'Cj* 


=  ^Ib-l- 


Hence,  from  Lemma  3.5(b)  and  the  fact  that  L  =Hj(A2A2)  Hj', 


Var(r)=Var(v)  +  (A*^^I  .-L*)^ 

'/I 


^  1=1  i'  ^  'n 

^  1=1   1^ 


where  ^*  =  o-^  +  A* ax^ .  D 


We  can  infer  from  Theorem  3.5  that  model  (3.149)  can  be  re- 
expressed  as 

T=(u*    ®  u)q  +  1  (3.151) 

^   i=l  r 

1 
where  |  =  H*/?  +  H*7*  +  (A*axl  * -L*)2Cj"£*  ~  N(0,(5*Ij^_-^)  independently  of  a 

and  T*  =  (A^A2)"U^Hie*.   The  vector  e*   is  defined  in  (3.129).   Model 
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(3.151)  is  similar  to  an  unbalanced  one-way  random  effects  model  with 
zero  intercept.   We  note  that 


ran 


kfflf  ®   1,^  )<  rankf  0  Ib.)^' 


But,  since  f  0  lu  ]la  =  lK  ^^^   ^llb~-b-1'  ^*  follows  that 

ranklHj  ©  1>  J<a-1.   Conversely,  by  the  Frobenius  Inequality  (see 

Lemma  A . 1 ) , 

rank^H*  _©  Ij^  J  >  rank(H*)  +  rankf  ©  Ij^  j-rank(Ijj) 

=  b-1  +a-b 


=  a-l. 


Hence,  rankfH*  _©  li  j  =  a-l.   Ve  have  therefore  just  proven  Lemma  3.6. 


Lemma  3.6 


In  (3.151),  rankfn*  ©  1,  )  =  a-l, 
^   i  =  l   i' 


Ve  can  now  develop  the  exact  test  for  cr"^. 

The  exact  test.   We  now  construct  an  exact  test  for  Hq:<7^  =  0 
vs.  Hj^icTq^^O.   To  start  with  let  F  be  the  (b-l)xa  matrix 


F  =  H*  ©  1,  .  (3.152) 

i  =  l  "^i 
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Also,  let  SSE^  denote  the  error  sum  of  squares  for  model  (3.151). 
That  is, 

SSE^  =  r'(Ij^_-^-F(F'F)~F')r.  (3.153) 

R^(g)  will  denote  the  sum  of  squares  for  the  g  effect  in  model 
(3.151).   That  is 

Rr(s)  =l'F(f"F)~F'l-  (3.154) 

In  the  next  theorem  we  find  a  generalized  inverse  for  F'F  and  show 
that  R^(g)  and  SSE^  are  independent  and  have  scaled  chi-squared 
distributions  (the  former  having  a  chi-squared  distribution  only 
under  Hgicr^  =  0)  . 

Theorem  3.6 

(a)      A  g-inverse  of   F'F  =  (    ©  l(    )llf'H*(    ©  1,    )   is 

^  i  =  l      i'  ^  i  =  l      i'^ 


(F'F)-=.|  bTi. 


(b)  RAq)/^*~xI_i,    under  Ho:<7^  =  0 

(c)  SSE^/5*~xta 

(d)  R^(q)    is  independent  of  SSE^ . 
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Proof : 


The  proof  of  Theorem  3.6  follows  along  much  the  same  reasoning 
as  that  for  Theorem  3.4  and  is  thus  omitted.  D 

A  test  statistic  for  testing  Hq:o-^  =  0  vs.  E^^-.a'^^O   is  given  by 


^    _R^(a)/(a-l) 
*"~  SSE^/(b-a)  ■ 


(3.155) 


Under  Hq,  Pa^^a-l    b-a"  ^°'^^   *^^* 


R^(a) 


a-1 


=  6y^  +  6* 


(3.156) 


where  B^^    is  the  ave 


rage  of  the  eigenvalues  of  FF' =  H*f  ©  J.  jH^'.   If 

^  i=l   i^ 


H*  denotes  bxb  Helmert  matrix,  then 


1  1' 
4b  -b 

Hi* 


(3.157) 


Consequently, 
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H{i,\K*M<,t,\y'' 


Hi^l  +  ^'^A'"!)""; 


Hence, 


"~a-l 


^4  toi 


i  =  l 


>0. 


(3.158) 


Therefore  we  would  reject  Hq:(7-^  =  0  for  large  values  of  F^. 


3.2.5.   Reduction  of  the  Exact  Tests  when  the  Design  is  Balanced 

Introduction ■   We  now  show  that  the  exact  tests  for  a^   and  a^ 
given  in  (3.155)  and  (3.138),  respectively,  reduce  to  the  usual  ANOVA 
tests  when  the  design  is  balanced.   The  test  statistic  F^,  given  in 
Section  3.2.1,  is  the  usual  ANOVA  test  for  testing  Hq:<7^  =  0  vs. 
Ha:<T2^0. 

When  the  design  is  balanced  we  have 


b.  =b*  Vi 


c.jHc*  V(i,j) 


(3.159) 


°ijk-"*  '^(i'J'l^)- 
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Consequently , 


i 


c =  i:  c .  .  = ab*c*  (3.160) 


n— j^  =  ab  c   n 


Also,  recalling  (3.104),  (3.105),  and  (3.111) 


^l=fib*c*  =  ^a®lb*®ic 


* 


Aj^.e.l  *  =  Ia®Ib*®-c*  (3.161) 

1 J  J 


i,J,k      n       be 


Thus  the  matrix  L  defined  in  (3.115)  becomes 


J/  _  1 


h^E^KE[^^  l^_^.  (3.162) 


n 


1  .    *       • 

It's  largest  eigenvalue  is  therefore  •^max~"**  '^^^   matrix  L  ,  used  in 

n 

(3.149),  was  defined  as 

L*  =  H*(A^A2)-^H*'. 
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en  the  design  is  balanced,  L*  becomes  (see  (3.161)1 


Vh 


=  ^Ib-r  (3-163) 

The  largest  eigenvalue  of  L*,  Aj^g^,^,  is  therefore  -^.      The  matrices  P^ 
and  Pj  were  defined  in  (3.136).   When  the  design  is  balanced  they 
reduce  to 

Pi=Hi(Ia(8.Jj^*®J^*)Hi 

(3.164) 
P2  =  Hi(I^®Ij^*®J^^)Hi 

where  Jg=ilgl'g. 

Reduction  of  the  exact  tests.   Consider  first  the  vector  w 
defined  in  (3.126).   When  the  design  is  balanced  w  becomes 


y  =  y  +  (^maxIr?i-L)2C;y 


u  (from  (3.162)) 

H^y  (see  (3.113)).       (3.165) 
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Using   (3.163),    the  vector  r,    defined   in    (3.149),    reduces   to 


=  V 


(from   (3.163)) 


=  H*w  (see    (3.140)) 

=  H*(A^A2)"U'2Hiw  (see    (3.139)) 

=  -^H*(Ia(8)Ij^*(8)l'^*)H;y      (from   (3.161)).    (3.166) 


Since  the  design  is  balanced, 


fc-l-P2  =  Hi[la®Ib*®(Ic*""^c*)Ki 


and 


P2-Pi=Hi[la®(Ib*-V)®VK 


Consequently, 


Ry(^l«)=y'(P2-Pi)y        (see  (3.135)) 
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yfla^Clb^-V^^-^c*^-  (3.167) 


The  last  equality  follows  since  HjH^  =  I^  — J^,  and  J, +(I,  *  —  J,  *)  =  0. 
Also 


SSEy  =  y'(I^_-^-P2)y      (see  (3.132)) 


rHiHi[la®Ib*®(Ie*-V)K"i^ 


y'[la®Ib*®(Ic*--^c*%-  (3.168) 


The  last  equality  follows  since  J  +(1  *  — J  *)=0.   Furthermore,  from 

c   c    c 

(3.153)  and  (3.154)  we  have 


SSE^  =  r'(Ij^_^-F(F'F)-F')r 

=  r'(Ib_^-iFF')r   (by  Theorem  3.6(a)) 

=  r'(Ib-l-"r(Ia®V)"*')-     (^^^  (3.152)) 
=  r'H*[l^®(I^^,-J^,)]H*'r. 
From  (3.166)  we  can  further  reduce  SSE^  to 


SSEj  =  -^\Q'X'K[^e.  ®  ( lb*  -  \*)ft'^tQrKv 
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=  ^y'HiQ;{la®(Ib*-V)K"i^ 


where,  from  (3.166). 


Since 


Qr  =  Ia®Ib*®-c*- 


1  n/ 


?Q'r[la®(Ib*-V>^ 


Qr  =  Ia®(Ib*~V)®"^c* 


we  have 


SSE^=^y'Hi[la®( 


^b*~'^b^ 


0®  J^*lHiw 


1  ,,/ 


y'(P2-Pi)y- 


Thus, 


c*SSE^  =  R^(/3|g), 


(3.169) 


Similarly,  we  can  show  that 


Rr(?)  =  r'F(F'F)~F"l 


VHi(Ia®Jb*®V)^i- 


VPi-- 
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Thus, 


c*R^(g)=y'H;PiHiy.  (3.170) 


Since  the  design  is  balanced, 


y  =  -Tr(Ia®lK*®^.*®ln*)y-  (3.171) 


Thus    (3.167),    (3.168),    and    (3.170)    can    be    expressed   as,    respectively, 


Ry(^l«)=^y'[la«'(Ib*-V^®'^c*®"^n*]^  (3.172) 


SSE, 


y  =  j*y[^ei®\*  ®  (^c*  "  "^c*^  ®  \*}~  (3 .  173) 


and 


Rr(^)=-T-*f(Ia-Ja)®Ju*®J   *®J    *>•  (3.174) 

ncL  Dcnj- 

Therefore,  from  Khuri  (1982),  it  follows  that  n*c*R^(g) ,  n*R^(/?|g) 
=  n*c*SSE^,  and  n*SSEj^  reduce  to  the  usual  ANOVA  sums  of  squares  for 
balanced  data. 

3.2.6.   A  Numerical  Example 

The  following  data  are  taken  from  Bennett  and  Franklin  (1954,  p. 
405)  and  has  been  made  unbalanced  for  illustrative  purposes.   The 
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data  represent  a  part  of  a  statistical  study  on  the  variability  of  a 
number  of  properties  of  crude  smoked  rubber.   The  figures  given  in 
Table  3.2  represent  measurements  of  the  modulus  at  700%  elongation 
made  using  24  samples.   On  each  of  these  samples  at  most  3 
determinations  of  the  modulus  at  7007o  elongation  were  made,  resulting 
in  a  total  of  53  observations.   The  three  factors  of  interest  are 
supplier,  batch,  and  mix.   We  assume  each  factor  to  be  random.   ^i  tI'£ 
will  represent  the  £   replicate  determination  from  mix  k,  which  is 
derived  from  batch  j  of  supplier  i.   The  linear  model  in  this 
experiment  is 


yijk£  =  /^+'*i+^ij+Tijk+^ijk£' 


where  1  =  1,2,3,4;  j  =  1 ,2,  •  •  •  ,b.  ;  k  =  l,2;  «  =  1 ,2,  •  •  •  ,n  •  jj.. 


In  this 


case  bj  =  2,  b2  =  4,  b3  =  3,  and  b4  =  3.   Notice  that  Cj  4  =  2  for  all  (i,j) 
so  that  the  design  associated  with  this  experiment  is  partially 
balanced. 

We  start  our  analysis  by  testing  for  a  mix  effect.  That  is,  we 
test  for  Hg:(T?,  =  0  vs  Eg^ial.  :;^  0.  The  appropriate  test  statistic  is  F^ 
given  in  Section  3.2.1.   In  the  present  case  we  find 

F^  =  1.3110 
numerator  d.f.  =12 
denominator  d.f.  =29 
p-value  =  0.2650. 
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Thus  there  does  not  appear  to  be  any  significant  variation  due  to  the 
mix  effect. 

To  test  for  a  significant  batch  effect  we  use  F^,  the  test 
statistic  developed  in  Section  3.2.3.   For  these  data  we  find 

F^  =  0.3800 
numerator  d.f.  =8 
denominator  d.f.  =12 
p-value  =  0.9114. 

No  significant  variation  in  the  moduli  of  the  24  samples  exist 
among  batches. 

The  test  statistic  F^^  (see  (3.155)1  is  used  to  test  for  a 
significant  supplier  effect.   Ve  find 

F^  =  39.0024 
numerator  d.f.  =3 
denominator  d.f.  =8 
p-value  <  0.0001. 

There  is  highly  significant  variation  in  the  moduli  of  the  samples 
among  the  four  suppliers. 

All  calculations  were  performed  using  S-plus. 
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Table  3.2    An  example  of  an  unbalanced  random  3-fold  nested  model 


Supplier 

A 

B 

C 

D 

Mix 

1 

2 

1 

2 

1 

2 

1 

2 

Batch  I 

196 

226 

196 

196 

189 

196 

323 

262 

200 

186 
190 

156 

186 

175 

279 
251 

234 

Batch  II 

250 

248 

193 

186 

211 

196 

238 

262 

238 

249 

196 

197 
210 

190 

250 

272 

Batch  III 

204 

174 

196 

180 

273 

223 

165 

172 

197 

166 

221 

256 

194 

186 

230 

Batch  IV 

209 
221 

202 
211 
204 

(Source:   Bennett  and  Franklin  (1954,  p.  405)] 
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3.2.7.   The  Power  of  the  Exact  Tests 

Introduction.   The  approximate  power  of  the  exact  test 
concerning  cr^,  the  variance  component  associated  with  the  nesting 
factor,  is  derived  in  this  section.   Similar  expressions  can  also  be 
developed  for  the  other  two  variance  components.   We  again  use 
Hirotsu's  (1979)  results  to  develop  our  expression  for  the  power  (see 
Appendix  D) . 

Let  $Q  denote  the  power  of  the  test  for  cr^.      Then  from  (3.155) 


rR^(Q)/(a-l) 
^'^  =  ^   SSE^/(b-a)  -^:/,a-l,b-al"a 


'a 


(3.175) 


where  Hg^:o-^7^0  and  R^(q)    and  SSE^  are  defined  in,  respectively, 
(3.154)  and  (3.153).   Under  Hj^,  R^(g)  is  distributed  like  a  random 
variable  of  the  form  [see  Box(1954b)J 


Ta=^E\iXi  (3.176) 

j=l  -^  -^ 


where  the  x^'s  are  independent  chi-squared  variates  with  one  degree 
of  freedom,  and  6    •    is  the  j   nonzero  eigenvalue  of 


V^FF'al  +  F{F'FyF'S*,  (3.177) 

where  F  is  defined  in  (3.152).   Equation  (3.177)  follows  from  Theorem 
3.5.   Clearly  FF'  and  F(F'F)~F'  commute  and  rank(FF':F(F'F)"F') 
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=  rank(FF')  =  a-1 .   Thus  there  exists  an  orthogonal  matrix  P  such  that 


Pi'P'  =  (o"   oV^+lo""'  oV  (3-"^) 


where  A^;^  is  the  (a-1)  x  (a-1)  diagonal  matrix  whose  diagonal  entries 
are  the  nonzero  eigenvalues  of  FF'.  (If  PUP'  is  not  in  the  form 
indicated  in  (3.178)  then  pre-  and  post-multiply  by  permutation 
matrices  until  this  form  is  achieved.   The  locations  along  the 
diagonal  of  the  nonzero  eigenvalues  of  FF'  and  F(F'F)^F'  must  match 
exactly.)   The  j   nonzero  eigenvalue  of  U,  S^^,    is  therefore 


^aj=Wa  +  ^* 


where  A  •  is  the  j   diagonal  entry  of  A^^  ( j  =  1 ,2,  •  •  •  ,a-l)  . 

The  approximate  power.   To  obtain  the  approximate  power  (see 
Appendix  D)  let 


MSEr  =  SSE7./(b-a)  (3.179) 


R^(g)/df 


_^^a-l) 

-   d*f  ^i',a-l,b-a-  (3.181) 
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2  2 

If  £„  =  — ^  =  — = ^- ,    then    in   this   case 

-5         '^^  +  Amax^ 


d*^6*  J-^ 


t:\A^.^^+i)^ 


a-1 
J 


1=1        -^ 


(3.182) 


f  = 


a-1 


.1=1        -^ 


j=l 


(3.183) 


Furthermore,    define 


f  2  =  b-a 


j=i 


e\a^.^^  +  i)- 
j=i   -^ 


=^  -1, 


e\a^.^^  +  i)' 
j=i   -^ 


(3.184) 


Then  "^^   fin  (3.175)1  can  be  approximated  by  using  the  formula  given  in 
(3.86)  and  making  the  appropriate  substitutions  for  H,  h,  d*,  f,  f2, 
and  p.      We  note  that  "d*"  in  (3.182)  plays  the  roll  of  "c"  in  (3.86). 
Since  ^  —  <^'y  + ^max'^e   ^®  ^®®  that  the  power  of  the  exact  test  will 
depend  on  the  design  parameters  A„,g^j^,  A*3^jj,  A^j ,  •  •  •  ,A   „_-i  j  the  level 
of  significance;  and  the  ratios  cr^/cr^,    (P'qI<t\,    and  ci.ja^   of  the 
variance  components. 
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The  design  parameters  A^,^^  and  Aj^^^j^.   Bounds  for  Aj_g,v  and  A 


are  given  in  the  next  lemma. 


Lemma  3.7 


(a)   The  largest  eigenvalue,  A^g^,^,  of  the  matrix  L  in  (3.115) 
satisfies  the  double  inequality 


c  ^       liT-rr  < -^max  ^ -7TT  (3.185) 

i,j,k   iJ^        n^^ 


where  n^  "^  =  min  (n-  -i,), 


(b)   The  largest  eigenvalue,  A*g^j^,  of  the  matrix  L*=H*(A2A2)  H*'  (see 
(3.149))  satisfies  the  double  inequality 


^  E.c|T^^max<^  (3-186) 


where  c^  ^  =min  (c-  •), 


Proof; 


(a)   If  e^3^j^(A)  denotes  the  largest  eigenvalue  of  the  matrix  A,  then 


■^max  —  ^max(^)  —  ^max(^i^^i) 


^maxC^^H^HJ 


•'V^.'- 
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<  en,ax(K)emax(I^i%) 


emax(K)einax(HiH'i) 


_„    (i(\  (Hi  has  orthonormal  rows) 


(3.187) 


.(0 


k. 


■         1    .    ^h.t  \    is  greater  than  or  equal  to  the  avera; 
It  is  also  true  that  A^^x  ^^  gicdi-c 

eigenvalue  of  L.   That  is, 


ge 


An.ax>^tr(L)=^tr(HM) 

=  ^tr[K(Ic-^Jc)] 

=  1  [tr(K)-itr(KJe)] 
c-1 


=  ^[tr(K)-il'cKlc] 


1 
c-1 


1    1 


V-    1  _1  V   .R-i-— 


_1   V   -1 
-c  .  2^   n- 


jk 


(3.188) 


Thus,  from  (3.187)  and  (3.188) 


1   Y^     1 ^  \         < 1 — 

^   E  ,  nr^S^maxS  (j) 
i,j,k   iJk        n^ 
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(b)   Recalling  that  (A^A2)-' =  ®  cT^.  and  H*  has  orthonormal  rows  (see 
(3.105)  and  (3.140)),  we  see  that  the  proof  of  (b)  will  follow 
similarly  to  that  of  (a). 

The  value  of  X^^^x  ^^^^    "«*  depend  on  which  (c-l)xc  suborthogonal 
matrix  is  used  in  (3.113)  (see  also  (3.115)).   To  see  this  let  H  be 
any  other  cxc  orthogonal  matrix  whose  first  row  is  :^  l'^-      Let  H^  be 
the  (c-l)xc  matrix  resulting  from  the  deletion  of  the  first  row  of  H. 
Then  there  exists  a  (c-l)x(c-l)  orthogonal  matrix  M  such  that 

Hi=MHi. 


Consequently, 


emax(HiKH'i)=en,ax(MHiKHlM') 

=  e„ax(HxKHi)    (M'M  =  I^_i) 


—  -^max ' 


Similarly,  A*ax  is  invariant  to  the  choice  of  which  (b-l)xb 
suborthogonal  matrix  is  used  in  (3.140). 
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3.2.8   Concluding  Remarks 

As  long  as  two  rank  conditions  are  satisfied,  the  results  given 
in  Seely  and  El-Bassiouni 's  (1983)  Section  3,  allow  us  to  conclude 
that  an  exact  test  for  the  second  to  last  stage  effect  in  unbalanced 
nested  designs  can  be  constructed  by  comparing  the  mean  square  for 
that  effect  (adjusting  for  all  the  other  effects  in  the  model)  to  the 
error  mean  square  for  the  model.   Such  a  test  would  be  a  Vald's  test. 
Thus,  the  test  for  <r„  given  in  Section  3.2.1  is  a  Vald's  test. 

The  above  observation  provided  the  motivation  for  the  two 
transformations  introduced  in  (3.176)  and  (3.149).   Each  of  these 
transformations  was  based  on  resampling  from  the  error  vector.   The 
first  reduced  the  analysis  of  the  unbalanced  3-fold  nested  random 
model  to  that  of  an  unbalanced  2-fold  nested  random  model.   While  the 
second  reduced  the  problem  to  that  of  analyzing  an  unbalanced  1-fold 
nested  random  model.   In  either  case,  the  exact  test  was  based  on 
comparing  the  mean  square  associated  with  the  second  to  last  stage 
effect  in  the  new  (transformed)  model  to  the  error  mean  square 
associated  with  the  new  model.   Consequently,  both  of  these  exact 
tests  are  Vald's  tests. 

Although  not  pursued  in  this  dissertation,  the  extension  to  more 
general  unbalanced  nested  models  should  offer  no  resistance.   A 
partial  extension  is  examined  in  Section  3.4.   There  we  study  a 
general  unbalanced  random  nested  model  where  the  imbalance  affects 
the  last  two  stages  only. 
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3.3  The  General  Unbalanced  Mixed  Model  with  Imbalance 
Affecting  the  Last  Stage  Only 


3.3.1    Introduction 

Khuri  (1990)  developed  exact  tests  for  testing  hypotheses 
concerning  the  variance  components  in  general  unbalanced  random 
models  where  the  imbalance  affected  the  last  stage  only.   This 
extended  the  work  of  Khuri  and  Littell  (1987).   Gallo  and  Khuri 
(1990)  constructed  exact  tests  for  both  the  random-  and  fixed-effects 
in  an  unbalanced  mixed  two-way  cross-classification  model.   In  this 
section  we  extend  the  work  of  Gallo  and  Khuri  (1990)  by  considering  a 
general  unbalanced  mixed  model.   We  will  require  that  the  imbalance 
affects  the  last  stage  only. 

After  some  preliminary  development  we  start  our  analysis  by 
studying  the  random  effects  in  the  model.   Next  we  construct 
procedures  for  the  fixed-effects.   Finally,  we  reexamine  our  model 
assuming  only  a  few  cell  counts  are  large  relative  to  the  smallest 
cell  frequency.   Under  this  scenario  we  will  employ  the  results  in 
Khuri  (1984)  to  construct  exact  simultaneous  confidence  intervals  on 
estimable  linear  functions  of  the  fixed-effects  parameters. 

In  our  construction  of  the  exact  procedures  we  will  closely 
follow  the  terminology  and  notation  laid  out  in  Zyskind  (1960),  Smith 
and  Hocking  (1978),  and  Khuri  (1982,  1984,  1990). 
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3.3.2   Preliminary  Development 

Introduction.   A  mixed  model  for  a  general  unbalanced  design 
whose  imbalance  arises  from  unequal  cell  frequencies  in  the  last 
stage  only  can  be  written  in  the  form 

ye=t'^6.Ce-)^'6  (3.189) 

1=0  1^  1' 

where  ^ =  (kj  ,k2,  •  •  • ,kg)  is  a  complete  set  of  subscripts  that  identify 

a  typical  response,  y.   The  sets  6-    and  d-    are  the  sets  of 

nonrightmost  and  rightmost  bracket  subscripts  fsee  Khuri  (1982)j, 

respectively,  associated  with  the  i   effect  7.  ,-x    ,  (  i  =  0  , 1 ,  •  •  •  ,1/)  . 

^i(^i) 

When  i  =  0  we  define  6^  =  9^  =  (j),    the  empty  set,  and  the  corresponding  7 

is  the  grand  mean.   We  assume  that  7.  ,-^   .,  for  i  =  0, 1 ,  •  •  •  ,i^-p 

^i(^i) 

(0  <  p  <  t/) ,  are  unknown  parameters  while  7.  ,-^    ,,  for  i=j/-p  +  l, 
and  £g   are  independent,  normally  distributed  random  variables  with 
zero  means  and  variances  '''w.rj+i  5  "  "'' j""^?  ^nd  ""e;  respectively. 

Since  the  design  is  balanced  except  for  its  last  stage,  the 
ranges  of  subscripts  kj,k2,---,kg  can  be  expressed  in  the  form 


fl,2, •  •  • ,a.      for  j=l,2,  •  •  •  ,s-l 

k.=^  o  -f    •  (3.190) 

J    1,2, •  •  •  ,n_      for  j=s,  ^ 


where  t  =  (kj  jkj,  •  •  •  ,k   ..  )  consists  of  the  first  s-1  subscripts  of 
We  assume  that  the  data  set  contains  no  missing  observations. 


5"? 
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Let  us   define   the   set  T  by 


T={r=(ki,k2,---,kg_i):    k.  =  l,2,.--,a.;    j  =  1 ,2,  •  •  ■  ,s-l}  .         (3.191) 


Then  c,  the  number  of  elements  in  T,  is 


s-1 
c  =  n  a..  (3.192) 

j=l  J 


Also,  let  ip-  =  (^8 .  lO  ■)    be  the  set  of  subscripts  associated  with  the  i 

effect  which  results  from  concatenating  the  sets  of  0-    and  6^ 

(i  =  0,1 ,  •  •  •  ,1/)  ;  V"?  is  the  complement  of  ip-    with  respect  to  r 

(i  =  0,1 ,  •  •  •  ,1/)  .   The  complement  is  with  respect  to  r  and  not  6   since 

kg  only  appears  in  the  Cg   term  of  (3.189). 

Matrix  formulation.   In  matrix  notation  model  (3.189)  can  be 
expressed  as 


y=i:Xi/3;+e,  (3.193) 

~  i=0 


where  y  is  the  nxl  vector  of  observations  (n  :=  total  number  of 

observations =  52  ^t) >    ^i  ^^  ^^   nxc-  ( i = 0, 1 ,  ■  •  •  f)  matrix  of  ones  and 
reT 

zeros;  /?•  is  a  vector  consisting  of  the  c-  elements  of  7.  .-^   . 
-1  1  o'^^y^j 

(i=0,l,---,i');  and  e    is  the  nxl  vector  of  random  errors.   The 
integer  c-  is  given  by 


f    1        for  i  =  0 

c.  =«^   „       -P    •   1  o  (3.194) 

1      n  a-   for  1  =  1, 2, •••,1/.  ^      ' 
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The  assumptions  made  earlier  concerning  the  7»  ,-^   ,  (i  =  0, 1 ,  •  •  •  ,i/)  can 
now  be  restated  as 


(i)     y8Q,/0j,  •  •  •  ,;5j^_p  are  unknown  parameter  vectors  for 
0  <  p  <  j^. 

(ii)     The  vectors  ^^_-q±.\i  '  •  •  i?i/t    ^.nd  €  are  independent  and 
normally  distributed  random  vectors  with  zero  mean 
vectors  and  variance-covariance  matrices  given  by 

Var(^.)  =(t|Ic.     for  i  =  i/-p+l ,  •  •  •  ,i/ 

Var(£)=<72l„. 

Besides  these  assumptions  we  will  also  assume 

(iii)    the  model  used  is  model  (3.189)  for  model  (3.193)1 


(iv)     n  >max  2c-  X;  m- ,  2c-  Y.     m- ,  2c-  Y.     ^\h  (3.195) 

^   i=o  '    iG:f<=      ieg-c  1^ 

where  m-  ,  f*',  and  S"^  will  be  defined  later. 


Assumption  (iv)  (along  with  the  other  assumptions)  are  needed  to 
insure  that  our  procedures  are  valid.   Later  we  will  strengthen 
Assumption  (iv)  by  assuming  that  n  >  2c  (see  Section  (3.3.7)1. 

From  Assumptions  (i)  and  (ii)  it  follows  that  model  (3.193)  can 
be  written  as 
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y  =  Xg  +  Zh  +  e  (3.196) 

where  X  =  (XqiXj  :  •  •  •  :X^_p)  ;  Z  =  (X^_p^j  :Xj^_p^2=  '  '  '  =^J^)  5 

g  =  (l9|>:^j:  .  •  •  :/?^   )'  is  the  vector  of  fixed  effects  and 

h  =  (/?'   ,  I  :/3'    r,:  •  •  •  :/?J^)'  is  the  vector  of  random  effects.   It  is 

easy  to  verify  that 

E(y)=Xg 

(3.197) 

fi  =  Var(y)=  _  £   <7|X.X'.  +^^1^^. 
i=j/-p+l 

Cell  means.   If  it  were  not  for  the  e    term,  which  consists  of  the 
n  elements  of  Cg,    model  (3.193)  or  (3.196)  would  represent  a  general 
balanced  mixed-effects  model.   Various  results  concerning  general 
balanced  mixed  models  can  be  found  in,  for  example,  Smith  and  Hocking 
(1978)  and  Khuri  (1982,  1984).   The  idea  then  is  to  reduce  the 
analysis  of  the  unbalanced  mixed  model  to  that  of  a  balanced  model. 
To  that  end  let 

kg-l 

if  y  is  the  vector  consisting  of  the  c  values  of  y^,  rGT,  then,  from 
(3.198),  we  have 

y  =  Dy  (3.199) 
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where 


'u^-lu/ 


D=  (V'¥)-'W' 


and  W  is  the  nxc  block  diagonal  matrix 


w=  e  1, 


Clearly  5 


D=  ®   ir-l'n  •  (3.200) 


Now,  from  (3.189)  and  (3.198)  we  have 


1/ 

■i5i'^i(^i) 


yr=  E7«  ,^  ^+fr.  ^^T  (3.201) 


where 


1  ''^ 


€T. 


ks=l 


Since    E  7^    /^   -,    in    (3.201)    is   of   the   same   form  as    in   a  balanced 
i=0  ''i^-^'i^ 

model,    we   can   express    (3.201)    in   matrix   form  as  (see  Khuri    (1982)1 


yr=i:Hi/?i+I  (3.202) 

i=0 
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where  the  cxc-  matrix  H-  can  be  expressed  as 


s-1 
H.  =  0  L.«,    i=0,l, •••,!/  (3.203) 

^  £=1  ^*^ 


and  L-«  is  given  by 


L;/;  =  S        .   ,    ,r  (3.204) 

i«   Ua^     for  k^Gt^? 


(i  =  0, 1 ,  •  •  ■  ,J^;  £  =  0, 1 ,  •  •  •  ,8-1)  .   Furthermore,  if 


H_  (Hq:Hj:  •  •  •  :Hj^_p) 


^-(^z/-p+l=^i/-p+2=  •  •  •■■K) 


then  from  (3.202)  we  obtain 


Define  the  cxc  matrices  A-  (i  =  0, 1 ,  •  •  •  ,i')  as 


Then  from  (3.203)  and  (3.204)  we  have 


(3.205) 


y  =  Hg  +  Fh+I.  (3.206) 


A.  =H.H'.,    i  =0,1,  ••-,»/.  (3.207) 


s-1 
A.  =  ®  M.«,    i=0,l, •••,!/  (3.208) 

^  £=1  ^'^ 
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where 


fla.    for  kg€Vi 

M^/»  =  S,      .   ,    ,r  (3.209) 

i2   \j      for  k^€^9 


(i  =0,1,  •  •  •  ,1/;  £=  1,2,  •  •  •  ,s-l).   Hence 


where 


E(y)=Hg=EHi/?i 
~   i=0 


E  =  Var(y)=   £   a^A.+cr^K 
i=j/-p+l 


(3.210) 


K=  e  nl^  (3.211) 


It  can  be  easily  verified  that  A- A  •  =  A  -A-  for  all  i,j  =  0,l,---,i^fsee 
Khuri  (1982,  p.  2910)].   Consequently,  there  exists  an  orthogonal 
matrix  Q  of  order  cxc  such  that 

QA.Q'  =  A.,  1=0,1, ■■■,u  (3.212) 

where  A-  is  a  diagonal  matrix  (see  Lemma  A. 5).   The  construction  of  Q 

is  described  below. 

u 

The  matrix  Q.   Recall  that  Yl  la   /■'a    \    ^°  (3.201)  is  not  affected 

i=0  ^i(-^i>' 

by  the  imbalance  in  the  last  stage  of  the  design.   Thus  it  can  be 
thought  of  as  representing  a  balanced  mixed  model  with  one 
observation  in  each  r-cell.   Let  P-  be  the  cxc  matrix  associated  with 
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the  sum  of  squares  for  the  i   effect  in  the  derived  model 


^^=So\Chr  <'•'"'' 


Some  properties  of  P- ( i  =  0, 1 ,  •  •  •  ,i/)  are  provided  in  Khuri  (1990, 
p.  181). 

Let  rank(P- )  =  m-  (  i  =  0, 1 ,  •  ■  •  ,i^)  ,  then 


Em.  =  i:rank(P.) 
i=0    i=0 


1/ 
:  51^  tr(P-)     (P-  is  idempotent  for 
i:=0 

i=0,l,-  •  ■,iy) 


*<?o^') 


tr(Ic)  (see  Khuri  (1990)  Lemma  3.1(iii)) 


Let  Q-  be  a  matrix  of  order  m-  xc  whose  rows  are  orthonormal  and  span 
the  row  space  of  P-  (i  —  0,1 , ■  ■  •  ,u) .      The  rows  of  Q-  can  be  obtained 
by  orthonormal izing  any  m-  linearly  independent  rows  (or  columns)  of 
P-  (i  =  0,1 ,  •  •  •  ,t/)  .   The  following  lemma  is  proved  in  Khuri  (1990). 
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Lemma  3.8 


The  matrices  Qn,Q]^,  •  •  •  ,Qt^  have  the  following  properties: 


(-)     «0  =  M 


(b)   Q.Q'.=I„.,   i=0,l, •••,!/ 

Q.q'j  =  o,  i^j 


(0,        V-i?:^,, 

(c)   A.q'.=<^  J    i=0, 1,---, J.;  j  =  0,l, •••,!/, 


where  b-  is  the  number  of  ones  in  a  typical  column  of  H-, 


j  =0,1,  •  ■  •  ,t/  (see  (3.203)V 


A  consequence  of  Lemma  3.8  is  that  if  ipi    represents  a  set  of 
subscripts  associated  with  a  random  effect  and  r/j  ■    represents  a  set  of 
subscripts  associated  with  a  fixed  effect,  then  Q.A-  =  0  which  implies 
q.Hj  =0  (i  =i/-p+l,  •  •  •  ,{/;  j  =0,1,  •  •  •  ,i/-p).   Hence  from  (3.205), 
QjH  =  0  (i  =i/-p-l-l,  •  •  •  ,1/).   The  matrix  A-  was  defined  in  (3.207). 

Write  the  cxc  matrix  Q  as 


Q  =  [Q(,:Q;:---:Q;,]',  (3.214) 

then  from  Lemma  3.8,  Q  is  orthogonal  and  simultaneously  diagonalizes 
AqjAj  ,  •  •  •  ,Aj^.   Furthermore,  from  (3.210), 
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Var(Qiy)=       E       ^'^QiA -Q'^  +  (T^Q.KQ'.  (3.215) 


j=i/-p+l 


(i=0,l, ••■,«/).      However ,    from  Lemma  3 . 8 


=  <5ilm.  »         i  =0,1,  •  •  •  ,J/ 


(3.216) 


where 


6.  =  Z    b.^2        i=o,l,---,./ 


(3.217) 


V^  =  {j:j/-p+l  <j<i^,V'iCV'J,    i=0,l, •••,!/. 


Thus  from  (3.215)  we  have 


Var(Q.y)=5.I„.+(r2Q.KQ'.,   i  =  0,1 ,  •  •  •  ,i/.    (3.218) 


Also,  for  i  7^  i' 


Cov(Q.y,y'q'.,)=Qi 


E^  .^jAj+cr^K 


j=i^-p+l 


Q'i, 


=  ^31 


E   crU.Q'^  +  a^KQ', 
j=»/-p+l  J  J  ^       ^ 


=  <r2Q.Kq'. 


(3.219) 
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3.3.3   Analyzing  the  Variance  Components 

Introduction.   In  this  section  we  develop  some  independent,  chi- 
squared  distributed  sums  of  squares  that  can  be  used  to  test 
hypotheses  concerning  the  variance  components.   These  sums  of  squares 
are  analogous  to  the  sums  of  squares  associated  with  the  random- 
effects  in  an  ANOVA  table  for  a  balanced  mixed  model.   In  fact  these 
sums  of  squares  will  reduce  to  the  usual  ANOVA  sums  of  squares 
(associated  with  the  random  effects)  when  the  design  is  balanced. 

The  key  to  the  construction  of  these  sums  of  squares  is  the  idea 
of  resampling  from  the  error  vector  (see  Appendix  C).   It  should  be 
noted,  however,  that  resampling  is  not  a  "cure-all".   Resampling  does 
not  guarantee  that  we  will  be  able  to  produce  an  exact  test  for  all 
the  variance  components  of  the  model.   This  should  not  be  too 
surprising,  though,  since  even  in  certain  balanced  models  the  same 
phenomenon  occurs. 

In  designs  with  imbalance  affecting  the  last  stage  only, 
resampling,  in  some  sense,  counteracts  the  imbalance  by  reducing  the 
analysis  of  the  original  unbalanced  model  to  that  of  a  balanced  one. 
Consequently  we  should  not  expect  to  do  any  better  (in  terms  of 
constructing  exact  tests)  when  utilizing  resampling  in  unbalanced 
models  than  if  the  design  was  balanced  to  begin  with. 

Initial  development.   Recall  the  definition  of  Q  in  (3.214). 

Let  Qj,  (r  for  random)  be  the  Ic— J^m-lxc  matrix 

^   1=0  ^ 
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qr=[Q'^-p+l  =  Q:.-p+2=---=W-  (3.220) 

Also,  define  the  (c— ^m-jxl  vector  Up  as 


ur  =  qry-  (3.221) 

The  use  of  the  matrix  Q^,  causes  Uj,  to  have  a  zero  mean  vector.   In 
fact,  it  is  easily  verified  that  (recall  that  Q-H^O,  i  =  iz-p+l ,  ■  ■  ■  ,1/) 

E(up)  =  QrHg  =  0 

(3.222) 

Var(uj,)  =   e   «5ilm.  +<^eGj. 
i=i/-p+l    1 


Gj.  =  Qr-Kq'p.  (3.223) 


Utilizing  resampling.   The  error  sum  of  squares  for  the  original 
unbalanced  model  in  (3.189)  can  be  expressed  as 


SSE=E(y^-yr)'=  E    E  (y^-Yr)'        (3-224) 

e  reT  ks=i 


where,  recall,  0 =  (t ,kg)   and  y^  is  defined  in  (3.198).   In  matrix  form 
(3.224)  becomes 

SSE  =  y'Ry  (3.225) 
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where 

R=Ijj-V(W'V)-V  (3.226) 

and  V  is  the  nxc  matrix 


V=  e  In  •  (3.227) 


In  Khuri  (1990),  the  following  lemma  is  proved  (see  also  Lemma  B.6). 
Lemma  3 . 9 

(a)  R  is  symmetric,  idempotent,  and  has  rank  n-c. 

(b)  DR=:0,  where  D  is  defined  in  (3.200). 

(c)  RX-=0,  i=0,l, •••,!',  where  Xn  ,X^ ,  •  •  •  ,Xj^  are  the  matrices  in 
(3.193). 

From  (3.197)  and  Lemma  3.9  we  can  conclude  that  y  and  SSE  are 
independent.   Furthermore 

Rn/(Tl  =  R  (3.228) 

thus 

SSE/<72~X2_^.  (3.229) 
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Now,  since  R  is  symmetric,  idempotent,  and  has  rank  n-c,  we  can 
express  it  as 

R  =  CrAr.Ci-  (3.230) 

where  Cj.  is  an  orthogonal  matrix  and  Ap  is  a  diagonal  matrix  with  n-c 
ones  and  c  zeros.   Without  loss  of  generality,  we  can  partition  Cj, 
and  Ap  as 


Cj.  —  [Cj-i:Cj,2:Cj.3J 


Ar  =  diag(I^^^,I^^^,0) 


(3.231) 


where 


i=0 


Z/-P 

7jj.2  =  n-2c+  X;  m, 
i=0 


(3.232) 


and  Cpi ,  Cj.2>  s-fd  C^^   are  matrices  of  orders  nXTj^^,  nXT]j,2>    a^nd  nxc, 
respectively.   Note  that   rj^^  +  1/^,2  =  n-c  =  rank(R)  .   Also  77^2  >  0  by 
Assumption  (iv)  (see  (3.195)).   Without  this  assumption,  tj^j  may  be 
nonpositive  which  would  invalidate  our  methodology. 
Since  Cj.  is  orthogonal 

C;-iCpi  =  I   (i  =  1,2,3) 

(3.233) 
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Consequently 


R  =  CpjCj-j -f  Cj,2Cj-2  (3.234) 


and 


SSE  =  y'Ry  =  j^'C^iC^-jy  +  /CrjC'ray 


—   obIl/pi    -j-  SoCj-pO 


(3.235) 


where 


SSE^.  =y'C^.C;.y,    i  =  l,2. 

Clearly  y  is  independent  of  both  SSEj,^  and  SSE^-j-   Also  since 
C^iR  =  C^i  (i  =  l,2),  it  follows  that 


c^iC;,in/<72  =  c^.(c;iR)n/<r2 


=  C^iC;i(Rfi)/<T2 

=  ^ri^ri^    (see  (3.228)) 

=  C  C'.,   i  =  l,2. 
ri  ri '       ' 

Thus  C^^C'^^n/aj   is  idempotent  (i=l,2)  and  CnCi-ifiCj-jC'r-j  =  0 .   Hence 
SSEj^j^  and  SSEj,2  are  independent  with 
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(3.236) 
SSEj.2/<T^~xJ^2- 

The  exact  tests.   Define  the  ij^-^xl   random  vector  Wj,  as 

u-r  =  Ur  +  (Ar,maxl,,^i  -Gr)^C^iy  (3-237) 

where  Gj.  was  defined  in  (3.223);  Aj,  j^^y^    is  the  largest  eigenvalue  of 

Gj,;  Uj,  was  defined  in  (3.221);  and  C^j  was  defined  in  (3.231).   The 

1 
matrix  i>>r ,ma.x^T]^^-^r)'^   is  well-defined  since  >^r ,ma.x^r]^i-^r   is 

positive  semidefinite  (p.s.d.). 
Partition  Up  in  (3.221)  as 


yr  =  (y'i.-p+r  •••=«!/)' 


=  (-yX-p+v  •  ■  ■ --yKy  (3-238) 


where  u-  is  of  dimension  m- ;  i  =  u-p+1 ,  •  ■  ■  ,1/.      Partition  Wj,  just  like 


Up.   That  is 


Vr=(v'i:---:v'uy  (3-239) 


where  w^  is  of  order  m-xl,  i  — u-p+l , ■ ■ ■ ,u .      The  major  result  of  this 
section  is  now  presented. 
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Theorem  3.7 

(a)  E(y.)=0,  i=u-p+l,-  ■  ■  ,u. 

(b)  yj,_„^]^  J  •  •  •  >Wj^  are  independently  distributed  as  normal  random 
vectors  with  u-    having  variance-covariance  matrix 

Var(y.)  =  (^j  +  A^. ,n,ax'^^)Im.  '  i  = '^"P+l '  •  "  '  ''^-  (3.240) 

(c)  Wj,  is  independent  of  SSE^j- 
Proof: 

(a)  From  (3.222),  E(ui,)=0.   Also  from  Lemma  3.9(c),  RX 

=  R[Xq:  •  •  •  :X^_p]  =0.   Thus 

Ci-iX  =  (C'r-iR)X  =  C'i,i(RX)  =  0. 

Consequently  the  result  follows  after  noting  that  E(y)  =  Xg  [see 
(3.197))  and  recalling  (3.237). 

(b)  Clearly  w^   is  normally  distributed.   We  claim  the  u^,  is 
independent  of  C^-jy.   Now,  since  DfiR  =  0  (y  is  independent  of 
SSE)  it  follows  that  DQCpi  =  0  (range(Cr.i)  C  range(R)V   Thus  y  is 
independent  of  C^jy.   Therefore  Uj,  is  also  independent  of  C^jy  as 
it  is  a  function  of  y  fsee  (3.221)].   Consequently 
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Var(yr)=Var(ur)  +  (Ar^„^^I^^^-Gr)2Ci.,fiCri(A^^„^^I^^^-GJ 


But,    from   (3.228), 


C^jQCp^  -  CpiRf2RCj,i  =  Cri(a£R)Cj-i  =  '^c^tj^^  • 


Hence,    from   (3.222), 


i=j/-p+l  1  '  /ri 


i=j/-p+l   ^      1  '  Tl 

Consequently   yj^.r, .  i  »  ■  •  •  ^Wj/  are    independent   and 

Var(y.)  =  (^i  +  Aj. ,inax<^€)Iin.  '    i  = ''-p+l ,  •  ■  •  ,i'. 

(c)   Since  Up  is  independent  of  SSE,  it  follows  that  it  is 

independent  of  SSE^-j  (range ( Cj-j )  C  range (R)j.   Also,  since  SSEj,^ 
and  SSEp2  are  independent,  it  follows  that  C^^y  is  independent  of 
SSEj-j-   Thus  Wj,  and  SSE^j  are  independent.  D 


Let  us  define  SS  •  as 
n 


SSj,i=y'iyi,   i=j/-p+l,- •  •,!/.  (3.241) 
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Then,  from  Theorem  3.7, 


(i)   SS      , -1  ,  •  •  •  ,SSj.j/,  and  SSE^^j  are  mutually  independent, 
(ii)  SS^./(5.  +Aj,^n,a^<72)~xra..  i  =  u-p+1,  ■  ■  ■  ,u. 

Thus,  analyses  concerning  the  variance  components  <''^_p,  i  ,  •  •  •  ,<7jy,  can 
proceed  using  SS   i,_r>+1  '  '  '  '  ^SSpj,,  and  SSE^j-   To  illustrate  the  idea 
let  us  suppose  we  wish  to  test  HqIct-  =0  vs  Eg^:cr-^0,    i  —  i/-p+l ,  •  ■  ■  ,i/ . 
Also,  suppose  that  there  exists  a  j(  ^  i )  G  {i/-p+l ,  •  •  ■  ,i/-(-l}  such  that 
under  Hq:o--=0 


SS 


ri 


^j  +  '^r,max<^£' 


(3.242) 


(For  notational  convenience  take  SS    ,..  =SSEj,25  ^i/+l=^'  ^^^ 
m  J  =  ry^j/^r  max*)   Then  a  test  statistic  for  this  hypothesis  is 


_SS^./m. 
ri-SS,ym/ 


Under  Hq,  F«^~F[n.  m  •  •   From  (3.217)  we  would  reject  Hg  for  large 
values  of  F  •. 

If  no  such  j  exists  satisfying  (3.242)  then  no  exact  test  can  be 
developed  based  on  our  methodology.   In  such  a  situation  we  must 
resort  to  Satterthwaite' s  procedure  to  construct  an  approximate  test 
for  testing  this  hypothesis. 
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3.3.4   Analyzing  the  Fixed-Effects  Parameters 

Introduction.   In  this  section  we  concentrate  on  the  fixed 
portion  of  the  model  (3.189).   When  possible,  exact  tests  will  be 
developed  to  test  hypotheses  about  the  unknown  parameters  in  the 
model.   When  no  such  exact  test  can  be  developed  we  will  utilize 
Satterthwaite' s  procedure  (after  making  an  appropriate  transformation) 
to  develop  an  approximate  test. 

Initial  development.   To  begin  with,  let  u  be  the  cxl  random 
vector  defined  as 


u  =  Qy 


(3.243) 


where  Q  is  the  cxc  orthogonal  matrix  defined  in  (3.214).   If  we  let 

J/-P 
Q-^  (f  for  fixed)  be  the  ^m-  xc  matrix 

i=0 


Qf=(Q|):Qi:---:Q'^_p)', 


(3.244) 


then,  recalling  (3.220),  we  can  write  (3.243)  as 


u  = 


say 


Uf 


(3.245) 


where  u^ = (uq:u^  :  •  •  • :u^_p)'.   It  is  easily  verified  that 


E(u)  = 


QfHg 
0 


Var(u)=  e  *:!„,.  +  (r^G 
i=0  ^   1 


(3.246) 
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where 

G  =  qKq'  (3.247) 

and  H  and  K  are  the  matrices  defined  in  (3.205)  and  (3.211), 
respectively.   Formula  (3.246)  follows  from  (3.215)-(3.219)  and  the 
fact  that  Q-H  =  0,  i  —  i^-p+l,  •  ■  ■  ,1/  (see   Lemma  3. 8(c) j.   Now,  from 
(3.246)  it  follows  that 

Var(u^)  =  e  <5-In,.  +  (r^G^  (3.248) 


where 


G_f  =  q^Kq'^.  (3.249) 


Also, 


Cov(u^,u'j,)  ^cr^q^Kq^.  (3.250) 

Thus  u^  and  u^  are  not  independent.   (They  would  be  if  the  design  was 
balanced  (see  Lemma  3.8(b))j. 

Now,  recall  that  m-  =  rank(P- )  =  rank(q-  ),  i=0,l,---,i^  (see  the 
discussion  after  (3.213)1.   Also,  let  z  be  the  cxl  vector  consisting 
of  the  c  z^'s  from  the  derived  model  (3.213).   Then,  from  Khuri's 
(1990)  Lemma  3.1  (iii),  we  can  express  the  total  sum  of  squares 
associated  with  model  (3.213)  as 
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z'z  =  zff:P.)z=  Ez'P.z.  (3.251) 

\=0  "■'      i=0 

Since  Q  is  orthogonal,  we  can  equivalently  express  (3.251)  as  (see 
(3.214)) 


z'z  =  z'Q'qz  =  zfE  q'.q.V  =  E  z'Q'iQi?. 

\=0    '       i=0 


Consequently,  since  QjQ:  is  idempotent  and  range(P- )  =  range(Q'- )  , 


p.  =Q'.Q.,   i  =0,1,  ••-,</.  (3.252) 


Lemma  3.10 


(a)   rank(q^H)=mp  i  =  0, 1 ,  •  •  •  ,iv-p, 


J/-P 
(b)   rank(H)  =  J]  m- . 
i=0 


(c)   If  r^=q'^q^Hg,  i  =0,1,  •  •  •  ,j/-p,  then  Tq,  li,---,r^_p  are  linearly 
independent  and  span  the  space  of  all  linearly  estimable 
functions  of  g,  the  vector  of  fixed  effects  in  model  (3.196). 


Proof: 


(a)   By  Lemma  3.8,  (3.205),  and  (3.207) 


,j/-p 


QiHH'q'i  =  qi(EA-)q'i 
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(      ^     ^jh^ 


jeUi 


wht 


Thus 


=  ^l^m-  '       i  =0»1»  •  •  •  ji^-P 


Ui  =  {j:0<  j<i/-p,    ^.CV-j},    i  =0,1,-  •  -,*/-? 


6l=   E  b.. 

JGU.    J 


rank(Q^H)  =  rank(Q^HH'Q'^) 


=  rank((5*I„.)  =m^,      i  =  0,1 ,  •  •  •  ,i/-i 


(b)      First   note   that 


Qf  HH'q!f  = 


Qz.- 


•'•I'-Pj* 


From   Lemma   3.8   we   have 


i/-p 


/^    H      X  Pilm..         i  =  i'  =  0,l^ 

Qi(    EA>'.,=      '      '  .,., 

^^  j=0  -^^  ^       [0,  1  7^  1  . 


,I/-p 
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Consequently, 


Q^HH'Q^^V^tl^., 


Thus 


rank(H)  >  rank(Q  .H)  =  rank(q  .HH'Q'.)  =  "£  m-  .  (3.253) 

i=0 


Conversely,  from  Lemma  3.8  and  the  fact  that  Q  is  orthogonal,  we 
have 


H  =  Q'QH  =  (  EQ'iQi)! 


i=0 


(  Eq-QiV  +  f     t     q-QiV 

^  i=0        ^       ^  i=i/-p+l        ^ 


J/-P 


(.i:/iqi>' 


Since 


j/-p  .  ,  j/-p  .      v-p 


(  EQ'iQiX  EQ'jQi)=EQ'iQi  +  E  q'iQiQjQi 

M=0        ^  j=0   -J    -^^     i=0  i  ^  j  -^    -^ 


1/-P 

-EQ'iQi 

i=0 


I/-P  I/-P 

we   see   that    Y,  Q'iQi    i^    idempotent  and  thus   rankl     Yl  Q' Q  ■  ) 
i=0  ^  i=0        ^ 


.  I/-P  u-p  J/-P 

=  tr(    EQiQih=  Etr(Q'.q.)  =  Emi.      Hence 
^  i=0        ^     i=0  i=0 


rank(H)  —  rank 
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(  i^o'^i^i)" 


.  I/-P  -      J/-P 

<rank(    EQiqi)=  Em:. 
^  i=0        ^     i=0 


(3.254) 


u-p 


Therefore,    (3.253)   and    (3.254)    imply   that  r(H)  =  E  ••>• 

i=0 


(c)   Recall  that  I:  =  Q':Q:Hg,  i  =  0, 1 ,  •  •  •  ji^-p.   Suppose  that  Tq, 

Ij>*"'>rt/-p  were  linearly  dependent,  then  there  would  exist 
constants  aQ  ,a*,  •  •  •  ,a|JJ_p,  not  all  zero,  such  that 


'lfatQiQiHg  =  0. 
i=0 


I/-P 


This  implies  that  EatQ';QiH  =  0.   Premultiplying  by  Qi(0<j<:/- 

i=0  -^ 

p)  results  in 


J/-P 


Clj(.EatQ'iQiH)  =  a;5qjH  =  0. 


Hence  we  conclude  that  a'^  =  0.   Consequently  a'!' =  0  for 
i  =  0,1 ,  •  •  ■  ,i/-p   which  is  a  contradiction.   Therefore 
lO'Ii '  ■  ■  ■ 'Iiz-p  must  be  linearly  independent. 


Now, 


Hg  =  Q'qHg  =  (  EQiQi)Hg 


i=0 


u-p  i/-p 

=  EQ'iQiHg=Eii 

i=0  i=0 
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Thus,  any  estimable  linear  function,  |'Hg,  of  g  can  be  written  as 

a  linear  combination  of  the  elements  of  Tq,t^,  •  •  •  ,T^^_■p   of  the 

i/-p 
form  ^  £'r-  .   Therefore,  rn,rj,  •  •  •  ,Tj^_p  span  the  space  of  all 

linear  estimable  functions  of  g.  D 


I 


From  Lemma  3.10  we  can  restrict  our  attention  to  developing 
methodology  concerning  IqjIi,  •  •  •  ,lj/_p.   Before  we  proceed,  let  us 
recall  that  fsee  (3.246)] 


u.  ~N(Q.Hg,  ^iIn,.+a,^qiKQ'.),  i  =  0, 1,  ■  •  •  ,i/-p 


u.^~HQ,6-^l^.    +<r'eQiW^^),    iQ  =  u-p^l,---,u. 


(3.255) 


In  (3.255)  notice  that 

QiHg  =  qiIi,  i=0,l,--.,i/-p.  (3.256) 

The  vector  of  "proper  error  terms".   If  the  design  was  balanced 
and  we  wished  to  test  a  testable  hypothesis  concerning  an  estimable 
linear  function  of  the  fixed  effects  in  the  model,  we  would  do  so  by 
scanning  the  "E(MS)"  column  of  the  corresponding  ANOVA  table  for  a 
mean  square  which  could  serve  as  an  "error"  term  (if  one  existed)  for 
the  test  statistic  used  in  testing  our  hypothesis.   This  mean  square 
would  be  chosen  in  such  a  way  that  under  the  null  hypothesis  our  test 
statistic  would  have  an  F-distribution.   If  no  such  mean  square 
existed  then,  typically,  we  would  utilize  Satterthwaite's  procedure 
to  develop  an  approximate  test. 
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When  the  design  is  unbalanced  there  generally  does  not  exist  a 
mean  square  that  can  serve  as  a  proper  error  term.   Consequently 
exact  tests  (based  on  the  ANOVA  sums  of  squares)  do  not  usually  exist 
for  the  above  mentioned  hypothesis  testing  problem.   The  most  common 
approximate  test  is  based  upon  Satterthwaite's  procedure. 
Unfortunately  the  sums  of  squares  involved  in  Satterthwaite's  quasi 
F-ratio  are  typically  neither  independent  nor  chi-squared 
distributed.   Either  of  these  "failures"  is  enough  to  invalidate 
Satterthwaite's  methodology.   This  has  led  some  authors  to  advocate 
the  use  of  traditional  (balanced)  methods  even  in  the  presence  of 
imbalance  (bHss  (1968)  or  Tietjen  (1977),  for  example). 

Ve  now  present  a  technique  that  will  allow  us  to  construct  exact 
tests  (when  such  exist)  for  estimable  linear  functions  of  the  fixed 
effects  in  model  (3.196).   This  will  be  done  by  using  resampling  to 
make  a  transformation  that  will  result  in  a  set  of  mutually 
independent,  (scaled)  chi-squared  distributed  sums  of  squares.   The 
idea  will  be  illustrated  by  considering  a  three-way  cross- 
classification  design  in  which  factors  A  and  B  are  fixed  while  factor 
C  is  random. 

Let  the  random  vector  v  be  defined  as 


^f 


(3.257) 


where  v^  and  v^,  are  determined  as  foil 


ows: 


(i)   Put  u.  (i  =0,1,  •  •  •  ,t/-p)  in  v^  if  and  only  if  there  exists 
an  iQ  e  {iz-p+l  ,-■■  ,u}   for  which  6-  =6-    . 

1     1a 
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(ii)  Put  the  corresponding  u^  (^q  =  i^-p+l,  ■  ■  •  ,1^)    in  y^,. 


Thus,  v^  consists  of  those  vectors  for  which  a  proper  error  term 
exists  and  v^.  consists  of  the  corresponding  vectors  used  in  those 
proper  error  terms.   Recall  that  11^  =  ^1^^  ^^lere  Qj^  is  defined  in 
(3.214),  k  =  0,l,  •  •  •  ,j/.   Also  6^   (k  =  0, 1 ,  •  •  •  , j/)  is  defined  in  (3.217). 
It  is  also  convenient  to  introduce  the  following  notation: 

:ff  =  {0  <  i  <j/-p:  u^    is  part  of  v^}  (3.258) 


;fj- =  {j^-p+1  <  Iq  <i/:  u.   is  part  of  v^}  (3.259) 


i'  =  a"^  u  f^ . 


We  note  that  f^  and  :f^   (and  hence  If)  may  be  empty.   In  such  cases  we 
cannot  apply  our  procedure. 

Example  3.2 

Consider  a  three-way  cross-classification  design  in  which 
factors  A  and  B  are  fixed  while  factor  C  is  random.   The  model  can  be 
expressed  as 

yijk£  =  ^+"i+/^j  +  («/?)ij+7k  +  (aT)i}.  +  (/?7)jk  +  («/37)ijk  +  fijkg   (3.260) 
(i=l,2,...,a;  j  =  l,2,-..,b;  k  =  l,2,.-.,c;  £  =  1 ,2,  •  •  •  ,n  •  ^j^)  . 
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Following  the  notation  in  Section  3.3.2:  u  =  7,    p  =  4,  and  7,  ,  (07),  , 
(/^T)jk'  ("/^T)ijk,  and  f^jj^^  are  independent,  normally  distributed 
random  variables  with  zero  means  and  variances  (T^,  a^,  a^,  ^f,  and  o-^, 
respectively.   Table  3.3  summarizes  the  pertinent  information  needed 
in  the  determination  of  v. 

From  Table  3.3  we  see  that 

(3.261) 
Yr  =  [u4:H5--U6:u7]'- 

Thus,  in  this  example,  the  vector  v  is  exactly  the  vector  u.   It  is 
interesting  to  note  that  if  we  had  instead  assumed  factor  A  fixed  and 
factors  B  and  C  random  then  v  would  not  consist  of  any  elements  and 
our  procedure  could  not  be  applied.   That  is,  no  exact  tests  (based 
on  our  methodology)  could  be  constructed  for  any  estimable  linear 
function  of  the  fixed  effects.   We  would  thus  need  to  apply 
Satterthwaite's  procedure.   More  is  said  about  this  in  the  sequel. 

Utilizing  resampling.   Recall  (3.225)  and  (3.226).   Instead  of 
the  partitioning  of  R  as  described  in  (3.230)-(3.232) ,  we  now  express 
R  as 

R  =  C^AfC^  (3.262) 

where  C^  is  an  orthogonal  matrix  and  A^  is  a  diagonal  matrix 
consisting  of  n-c  ones  and  c  zeros.   We  can  partition  C^  and  A^  as 


-129- 


Table  3.3     Information  needed  to  determine  the  vector  v  in  (3.257) 
for  a  three-way  cross-classification  model  in  which 
Factors  A  and  B  are  fixed  while  Factor  C  is  random 


^i 


where 


^i  W.(see    (3.217))  S^(see   (3.217)) 


0 

4 

abc 

{4,5,6,7} 

abo-^ 

+  h<rl  +  a.al  +  aj 

1 

{i} 

be 

{5,7} 

haj  +  (7-2 

2 

{J} 

ac 

{6,7} 

aal  +  aj 

3 

{i,j} 

c 
ab 

{7} 

-? 

4 

{k} 

{4,5,6,7} 

abo-^ 

+  b<r|  +  a^l  +  a} 

5 

{i,k} 

b 

{5,7} 

K  +  ^? 

6 

{j,k} 

a 

{6,7} 

a-crl  +  0-^ 

7 

{i,j,k} 

1 

{7} 

-? 

Cf-[Cfj:Cf2  =  C-f3] 


Af  =  diag(I^^^,I^^^,0), 


1  e:f*^ 


r?^2  =  n-2c-f     E     m. 

i  ea"^ 


(3.263) 


(3.264) 


and  j"^  is  the  complement  of  f  with  respect  to  {0,1 ,  •  •  •  ,i/} .   If  j"^  =  0 

then  7?^j  =  c  and  v^2  =  n-2c.   Note  that  t?^^  +  t?^^  =  "- ^  =  rank (R)  .   Also, 
by  (3.195),  »7f2>0. 
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As  V  (in  (3.257)1  consists  of  those  u-  for  which  iGif,  let  Qy 
consist  of  all  those  Q-  for  which  iG^.   In  Example  3.2 
Qy  =  Q  =  [QqiQj  :  •  •  • :Q^]'.   If  v  is  void  of  elements  (or,  equivalently 
i' =  (^)  then  Qy  will  not  contain  any  elements  and,  as  mentioned 
previously,  our  procedure  cannot  be  applied. 

Now,  define  the  random  vector  Ur   as 

yf  =  ^+(^f,maxVl-^f^'^^i^  (^-^^^^ 

where  A^     is  the  largest  eigenvalue  of  G^^  and  G^  is  the  matrix 

Gf  =  QvKqv-  (3.266) 

1 
The  matrix  (Af  ^^^^I;,^^  -  G^)2  is  well-defined  since  i^f  ^^^^^rj^^-^f)    is 

p. s .d. 

Before  we  proceed  we  need  the  following  notation:   Suppose 

S  =  {pq,Pj  ,  •  •  •  ,/j-j.}  where  each  p-    is  an  integer  satisfying 

Pq  <  Pj  <  •  •  •  <  p^  and  t  +  1  is  the  number  of  elements  in  S.   Then 

w=(wj:jeS)^(w^^:w;,^:...:wy. 

Now,  partition  y  as 

y=(v.:iGa'),  (3.267) 

where  each  y^  is  of  dimension  m-,  ieS'.   Ve  note  that  {y:iGi'} 
represents  a  subset  of  {uq,Uj  ,  •  •  •  ,u^} .   Partition  w^  just  like  y. 
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That  is 

y^=(w^.:iGf)  (3.268) 

where  u^-    is  of  order  m ■  x  1 ,  i€^.   The  key  result  of  this  section  is 
given  in  the  next  theorem  which  provides  some  distributional 
properties  for  {wo.:iei'}  and  justifies  the  transformation  made  in 
(3.265). 

Theorem  3.8 


where  r-  (i  =  0, 1 ,  ■  •  •  ,t/-p)  is  defined  in  Lemma  3.10(c). 

(b)   {yfj:i€;f}  consist  of  independently  distributed,  normal  random 
vectors  with  ur-    having  variance-covariance  matrix 

Var(wfi)  =  (5.+Af^^^^a2)I„.,  igf.         (3.270) 

Proof : 

(a)   From  Lemma  3.9(c),  RX-=0,  i  =0 ,1 ,  •  •  •  ,u .      Therefore,  since 
range ( Cj^  )  C  range (R)  and  R  is  symmetric,  it  follows  that 
CVX-=0,  \=0,l,---,u.      Consequently 

E(C'f^y)=C'f^Xg  =  0. 
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(recall  that  X =  [XqiXj  :  •  •  • :X^  ]) .   Since 


E(u.)=(ji^i'      i=0,l,...,.-p 


(see   (3.255)),  it  must  be  that 


E(v.)=   ^-i'     '^'^ 


Thus  the  result  follows  after  recalling  the  definition  of  y. 
given  in  (3.265). 

(b)   Clearly  y^.  is  normally  distributed  for  each  iG^f.   Now  recall 
that  u.  =Q.y,  i  =0,1,.  •.,!/,  (see  (3.243)).   Since  v  consist  of 
exactly  those  u-  for  which  i  e  j-  and  %   consists  of  the 
corresponding  Q^ ,  it  follows  that 

>^  =  Qvy-  (3.271) 

Since  y  is  independent  of  SSE,  DnR  =  0.   Since  range(Cfj) 

Crange(R),  DnC^^^O.  Thus  y  is  independent  of  C^^y. 

Consequently  v  is  also  independent  of  C^^y  as  it  is  a  function  of 
y.   Therefore 

Var(.,)=Var(y)  +  (A,^^^^I^^^-G,)^^fiC,^(A,^^^^I^^^-Gf)i 
By  the  now  familiar  argument,  it  is  easy  to  show  that 
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C'f  jfiCf  J  -  '^fl^^j  • 


Also,  from  (3.217)  and  (3.218), 

Var(v)=  .®J^l^.+alG.. 
Hence,  after  some  simplification,  we  obtain 

V-(yf)  =  .|,(^i+-?A^,,ax)S- 
Thus 

Var(y^.)  =  (5.+.|A^^^^^)I„.,  ief. 
From  Theorem  3.8,  we  can  conclude: 

yfi~N(Q.r.,  -efilm.)  for  i  e  If ^ 


and 


Vf.  ~N(0,  ^f  I^.  )  for  iQE^r 
^0         ^0   ^0 


where 


(3.272) 


^fk  =  '^k  +  Mf,max'  ke^.  (3.273) 

The  exact  procedures.   First  recall,  from  Lemma  3.10(c),  that 
ri=Q'jqjHg,  i  =0,1,  •  •  •  ,j/-p.   Consequently, 
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^'i^iTi=Ti,      i  =0,1,.  •  .,i/-p.  (3.274) 

Thus,  constructing  exact  simultaneous  confidence  intervals  for  all 
linear  combinations,  dV •  ,  of  r.  (d€R^)  is  equivalent  to  constructing 
exact  simultaneous  confidence  intervals  for  all  r'Q.r-,  where  r'=  d'Q' 
and  hence  is  in  the  row  space  of  Q'  ,  i  =  0, 1 ,  •  •  •  ,j/-p. 
From  (3.272)  we  can  conclude  that 


(i)  (yfi-Qiri)'(yfi-Qiri)/efi~xL,  ie^ 


f 


(3.275) 


(ii)  yfin^fiy^fi^-X^,  ,  ioe^r- 


'O^^O  ^^0   '"if 


By  our  construction  of  v  in  (3.257),  for  each  i  G  :ff  there  exists  an 
iQE^r    such  that  ^i^  =  ^i  (implying  ^fi^  =  ^fi).   Therefore  if  i  G  :ff  and 
iQG:fr  are  such  that  ^f;=^^-    then,  for  MS.-  =w'  .  w..  /m.    , 

•f^SfiJ^^fi^^^fi  (3.276) 


and 


(yfi-Qiri)'(yfi-Qiri)/mi 

W7, S'">i  •        (2-27^) 


Hence,  simultaneous  confidence  intervals  on  all  linear  functions, 
r'q^r.,  of  Q^r.  (r  G  range(Q.  )j  can  be  obtained  from  the  formula 
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r'Qiri€r'y^i±(miMSf.  F^^      r'r) 


1-a  (3.278) 


(iGS'f,  ioG^r  such  that  ^fi  =^fi).   We  note  that  r'w^.  =  d'Q'.y^.  and 
thus  since  Q'^Q.  is  idempotent,  d'Q'-w^.  is  the  least  squares  estimator 
of  dVj  iGlf^  (see  (3.272)).   Formula  (3.278)  is  based  on  Scheffe's  S- 
method  (sheffe  (1959,  pp.  68-72)). 

To  test  HqiLV.  =1  for  i£^^,    where  L'  has  full  row  rank  r^,    we 
proceed  as  follows:   Recall  (3.272),  let  Ij  =  Q-y^ ^  be  the  least 
squares  estimator  of  r-,  iGJ'r.   then 

L'fi-|  =  L'Q'iyfi-|~N(L'ri-«,  ^f.L'Q'iQiL)        (3.279) 

{ief^).      Because  we  are  assuming  that  the  above  hypothesis  is 
testable  for  all  iG^f^,  L'r-  consists  of  a  set  of  linearly  estimable 
functions  of  r  •  ,  i  G  :f  ^ .   Thus,  there  exists  a  full  row  rank  matrix  M 
(rank(M)=rL)  ^^^^  *^^* 


L'  =  M'q'iQi,   ie=^f  (3.280) 

Then,  taking  r^  <  m- ,  iG^_f,  it  is  easily  shown  that  M'Q'j  has  full  row 
rank  r^,  ief^.      (rL  =  rank(L')  <  rank(M'Q'.  )  ,  i  G  ^^ ;  but  M'Q'i  is  rLXm- 
and  rL<m.  and  so  rank(M'Q'i )  <  r^,  ^  ^  ^f)'   Furthermore 


L'Q'iQiL  =  M'q'.q.M,   iGlff,  (3.281) 

consequently,  for  i  e"}^ 


-136- 
rank(L'Q'iqiL)=rank(M'Q'i)=rL.  (3.282) 

Thus,  L'q'^Q^L  is  nonsingular  for  i€^^.      Therefore,  if  we  define 

SSHi  =  (L'Q'.y^ .  -«)'(L'Q'iQiL)-i(L'Q'.yf .  -|)        (3.283) 
(ieif^),  we  obtain 

SSH./^^.  -Xr^C^i)'   ie:ff  (3.284) 

where 

2.^^ .  A.  =  (L'li  -|)(L'Q'iQiL)-i(L'r.  -|)  ,   i  e  Iff .       (3.285) 

Now  if  ipGf^  is  such  that  ^f.  =  ^^ .  ,  iga-^,  then 

SSH./rr 
\=~MS7 FrL,m.  (Ai),  (3.286) 

where,  recall,  MS^ i^  =  ^f i^^f i^/m-  .   Under  Hq,  A •  =  0  and 
^H.~^rr,m.  •   Thus,  Fj,   can  be  used  to  test  Hn:L'r.=£,  iq^..      We 
would  reject  Hq  for  large  values  of  the  test  statistic. 

An  approximate  test.   When  there  does  not  exist  a  proper  mean 
square,  MS^-^,  with  the  property  that  E[MSf.  ]  =  .^f  i  (  i  =  0, 1 ,  •  •  •  ,i.-p; 
^-P+1  <  io<i')>  then  we  cannot  apply  the  above  methodology  to  construct 
a  test  for  UQ-.L'r^^i,    i  =  0, 1 ,  •  •  •  ,  t/-p.   One  common  approximate 
procedure  is  to  develop  a  "synthetic"  mean  square  that  represents  a 
linear  combination  of  the  mean  squares  associated  with  the  random 
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portion  of  model  (3.243)  such  that  its  expected  value  is  ^^^ 
(i  =  0, 1 ,  •  •  •  ji'-p)  .   This  is  Satterthwaite's  procedure.   Unfortunately, 
the  numerator  and  denominator  of  the  resulting  quasi  F-ratio  will  not 
be  independent  as  required  fsee(3. 250)1.   To  overcome  this  difficulty 
we  first  make  a  transformation  (utilizing  resampling)  that  will 
result  in  a  set  of  mutually  independent,  (scaled)  chi-squared 
distributed  sums  of  squares.   Then  (rightly)  apply  Satterthwaite's 
procedure  to  construct  the  approximate  test.   We  will  illustrate  the 
idea  by  considering  a  three-way  cross-classification  design  in  which 
factor  A  is  fixed  while  factors  B  and  C  are  random. 
To  begin  with  recall  from  (3.238)  that 

Let 

^f  =  {0  <  i  <  J^-p:  u^is  part  of  v^}  ,  (3.287) 

fsee  (3.257)  and  (3.259)).   Where  the  vector  v^  is  defined  as  follows: 

(i)   Put  u- (i  = 0, 1 , •  •  •  ,f-p)  in  yS  if  and  only  if  there  does  not  exist 
an  IqS  {i/-p+l,  •  ■  ■  ,1/}   for  which  6-  =6-      (iJk,  k  =  0, 1 ,  •  •  •  ,j/,  is 
defined  in  (3.217)V 

Thus,  f^  identifies  the  subscripts  of  the  u-  (i  =  0,1 ,  •  •  •  ,i^-p)  that 
are  in  v^.  Note  that  every  u^  (i  =  0, 1 ,  •  •  •  ,J/-p)  is  in  either  v^  or 
vS.   Define  x  as 
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x=-^.  (3.288) 


In  (3.288),  v9  is  not  independent  of  Uj,  fsee  (3.250)j. 

Example  3.3 

Consider  a  three-way  cross-classification  design  in  which  factor 
A  is  fixed  while  factors  B  and  C  are  random.   The  model  can  be 
expressed  as  in  (3.260).   In  the  notation  of  Section  3.2.2  we  have 

9     0         2       1   2 

1/ =  7  and  p  =  6.   The  variance  components  are  (r2»  ""a*  '  '  '  s'''?)  ^""  ^e- 
Table  3.4  summarizes  the  pertinent  information  needed  in  the 
determination  of  x. 

From  Table  3.4  we  see  that 

Yf  =  [u():ul]  =  Uf,  (3.289) 

where  u^  was  defined  in  (3.245).   Thus,  in  this  example,  the  vector  x 
is  exactly  the  vector  u  (see  (3.243)1. 
Now,  let  T  be  the  set 

g'  =  :fS  U  {i/-p+l,  •  •  ■  ,u}.  (3.290) 


Also,  define 


Vai  =  c  -  E  "i 

ieg-^ 


'/a2  =  n-2c+  E  m. 

ieg-^ 


(3.291) 
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Table  3.4    Information  needed  to  determine  the  vector  x  in  (3.288) 
for  a  three-way  cross-classification  model  in  which 
Factor  A  is  fixed  while  Factors  B  and  C  are  random 


i 

^i 

h 

V.(see    (3, 
{2,3,4,5,6,7} 

.217))              5.(see    (3.217)) 

0 

fli 

abc 

aco-^  -f  ccrl  +  abcr^  +  hcrj  +  a(Tg  +  aj 

1 

{i} 

be 

{3,5,7} 

C(t\  +  \ia\  +  (t\ 

2 

{J} 

ac 

{2,3,6,7} 

acal  +  cal  +  acTg  +  a^ 

3 

{i,j} 

c 

{3,7} 

cal  +  aj 

4 

{k} 

ab 

{4,5,6,7} 

ahaj  +  haj  +  aal  +  aj 

5 

{i,k} 

b 

{5,7} 

haj  +  aj 

6 

{j,k} 

a 

{6,7} 

a4  +  (T? 

7 

{i,j,k} 

1 

{7} 

4 

where  g"^  is  the  complement  of  g"  with  respect  to  {0, 1 ,  •  •  •  ,  j/} .   Note 
that  r;ai  +  r7a2  =  n-c  =  rank(R).   Also,  t]^^>0   by  (3.195).   The  matrix  R 
IS  defined  in  (3.226).   We  now  introduce  a  transformation  that  will 
result  in  a  set  of  independent,  (scaled)  chi-squared  distributed  sums 
of  squares.   The  development  of  this  transformation  is  similar  to  the 
development  that  led  up  to  (3.265)  and  is  thus  only  outlined. 


Define  the  random  vector  u^    as 


1 

ya  =  x+(Aa^„^^I^^^-Ga)2C^iy  (3.292) 
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where  G^  =  Qx'^Qx  ^"*^  '^x  consists  of  those  Q^  for  which  iG^.   (In 
Example  3.3,  Qx  =  Q.)   -^ajinax  ^^  *^^  largest  eigenvalue  of  G^;  and  C^^ 
as  an  n  x  r/g^j  matrix  constructed  similarly  to  C^j  in  (3.263),  for 
example. 

Partition  x  as 

x  =  (x.:iGg-),  (3.293) 

where  each  x-  is  of  dimension  m-,  i€^.   We  note  that  {x^ :  i  G '5'} 
represents  a  subset  of  {uq,u^  ,  •  •  •  ,u^}  .   Partition  Ug^   just  like  x. 

That  is 

ya=(yai  =  i^"^)'  (3.294) 

where  w,  •  is  of  order  m-xl,  IgT.   Ve  then  have 
Theorem  3.9 


fq-r.,    iG^fS 


(3.295) 


(b)  {<jj    •  :  i  G 'iT}  consists  of  independently  distributed  normal  random 
vectors  with  w  •  having  variance-covariance  matrix 

Var(w^.)  =  (6.+Aa,max'^^)Im.»  i^^- 
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Proof : 

The  proof  of  Theorem  3.9  is  similar  to  that  of  Theorem  3.8  and  is 
thus  omitted.  D 

In  order  to  construct  an  approximate  test  of  the  testable 
hypothesis  Hq:L't.  =|  vs.  Ug^-.L'T-^^l.,    ief^,    we  apply  Satterthwaite' s 
procedure  (in  the  usual  manner)  to  the  set  of  vectors  {w^^ : i £  ^} . 

3.3.5   Reduction  of  the  Tests  When  the  Design  is  Balanced 

If  the  design  is  balanced,  that  is,  if  n^  =  N  for  all  reT  (see 
(3.190)  and  (3.191)],  then  the  cxc  matrix  K  given  in  (3.211)  becomes 


K-N^c- 


Consequently,  the  matrices  Gj.,  Go,  and  Gg^  become,  respectively,  j^I^  , 
il^  ,  and  il^  .   (See  (3.223),  (3.266),  and  (3.292).)  Hence,  each 
matrix  has  i  as  its  largest  eigenvalue.   Thus,  when  the  design  is 
balanced,  Wj,,  w^,  and  w^^  reduce  to  u^,,  v,  and  x,  respectively,   fsee 
(3.237),  (3.265),  and  (3.292),  respectively.)  Recall  that  u^.  =  Qj,y 
(see  (3.220)  and  (3.221)).   Similarly,  Y  =  Q^y  (see  (3.271))  and  x  =  Qj^y 
(see  (3.288)  and  the  discussion  after  (3.292)). 

Now,  from  (3.243),  and  (3.252),  and  the  fact  that  the  matrix  Q 
in  (3.214)  is  orthogonal,  we  have 

u'u  =  y'Q'Qy 
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=r  ( A^iOi)? 


i=0 


=  Ey'Piy- 

i=0~ 


(3.296) 


From  Khuri's  (1990)  Lemma  3.2  and  (3.208)  we  have 


y'PiY  =  y' 


=  y 


j=0  ^J  J  J 


|o^''j"'j'(Si"j') 


(3.297) 


(i  =0,1,  •  •  •  ,1/).  A.  ( j  =0,1,  •  •  •  ,1^)  was  defined  in  (3.207).  Here  A^  • 
is  a  known  constant  whose  possible  values  are  -1,  0,  and  1  (see  Khuri 
(1982)  for  a  more  detailed  definition  of  the  A-  •).  Since,  the  design 
is  balanced 

y  =  ^(ic®lN)y- 

Therefore  (3.297)  can  also  be  expressed  as 


rPiy  =  ^y'(ic^iN(4^'ij/'j)(SiS«) 


(ic®iN)y 


|o^'ij/'jH(;!^j>JN} 


=  V 


.5o^'ij/''jH(g^J^)^^N} 


(3.298) 
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(i  =  0,1,  •  •  •  ,1/).   Thus,  when  the  design  is  balanced,  Ny'Pny, 
Ny'Piy,  •  •  •  ,Ny'Pj,y  reduce  to  the  usual  balanced  ANOVA  sums  of  squares, 

Recalling  the  expressions  for  u^,,  v,  and  x  given  above  we  see 
that  when  the  design  is  balanced 


y'r^r^HrUr^  E   y'PiY, 
i=/^-p+l 


Consequently,  since  {j/-p+l ,  .  .  .  , j,} ,  y,  and  g"  are  all  subsets  of 
{0,1,  ■  ■  ■  ,1/},    the  exact  tests  developed  for  the  random  and  fixed 
effects  reduce  to  the  usual  ANOVA  tests  associated  with  a  balanced 
mixed  model  of  the  form  given  in  (3.189).   The  approximate  test 
reduces  to  the  approximate  test  (based  on  Satterthwaite' s  procedure) 
that  would  have  been  obtained  if  the  design  had  been  balanced  to 
begin  with. 

3-3.6   The  Power  of  the  Exact  Tests 

In  this  section  we  examine  the  power  of  the  exact  tests  (when 
such  exist).   To  begin  with  we  consider  the  power  of  the  exact  test 
associated  with  the  i^^  variance  component  (  i  =  j/-p+l ,  ■  •  •  ,i/) . 

The  power  of  the  exact  test  associated  with  the  hypothesis 
Hq:o-^  =0,  i  =t/-p+l,  ■  ■  ■  ,u.      For  sake  of  convenience,  let 
^^r,i/+l=SSEr2,  Vl^^'  ^"^  '?r2  =  m^+i/V.max  (see  (3.232)  and 


■144- 


(3.235)j.      We  wish   to   test  Eq:<t\=0  vs.    Ha:<T|7^0,    i  =  i/-p+l ,  •  •  •  ,i/. 
Suppose   that   there   exists  a   j(  7^  i)  G  {j/-p+l ,  •  •  •  ,t/+l}    such   that   under 


ss 


ri 


-^j+^r,maix'^c  _  E 


SS, 


rj 


as  in  (3.242).   Then  the  appropriate  test  statistic  is  given  by 


_SS^i/m. 


Hn  F„ 


'1   SS^j/mj  "0  •m.,mj 


Under  E^: cr'^^  :^  0 ,    we  have 


SS^./m.  keW: 


E  t>,a?+A,     -2 


'k^k  +  '^r,max<^e 


E  Ml  +  A,     -2 


kev. 


'k'^k"^'^r,max<^e 


ss,yn.j 


mi,my 


This  follows  from  (3.217)  and  Theorem  3.7.   Hence  if  *  .  represents 
the  power  of  the  test  for  a?,  i  =  i/-p+l ,  •  •  •  ,j/,  tht 


len 


*  .  =  PfF  .  >  F       IH  "I 
ri    ^    ri-    oi,m^,m-\'^a.) 


-PFm.,m.>..u^   F. 


^i'^j-l+b.^^.  'a,m.,m. 


(3.299) 


wh 


ere 


ri 


,  5,  ''k'^k  +  '^r,max'^€ 

K  e  w  • 


2' 


(3.300) 


(i -1/-P+I,  •  ■  .,//;  j(  7^  i)  e{z/-p+l,  .  .  .,t/+l}  such  that  (5.  =5.).   Sin 


ce 


'^. 


I  increases  monotonically  with 


'ri'  1  =''-P+l,  •  •  •  ji/,  it  follows  that 
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it  is  a  monotone  decreasing  function  of  Aj,  ,^3^^^  ffor  fixed  ratios, 
cP'- 1 o\   (  i  =  f-p+l ,  •  •  •  ,1^)  ,  of  the  variance  components).   It  is  thus 


desirable  for  A^,  ^^^  to  be  small, 


The  power  of  the  exact  test  associated  with  the  hypothesis 
HnrL'r.  =1,  i€^-f.   To  test  HQ:L'r^=£,  iG^f^,  the  appropriate  test 
statistic  was  (see  (3.286)1 

SSHj/rL 
^Hi=  MS..   '  '^^f 

Recall  that  rL  =  rank(L).   Also,  SSH^  (ie^f)  was  defined  in  (3.283). 
Furthermore,  Iq  G  i'^.  is  such  that  E[MSfj^  ]  =  ^f  i  (^ee  (3.259),  (3.273), 
and  (3.276)].   Now,  under  Ha,:L'r^7^£,  Fy^  has  a  noncentral  F- 
distribution  with  noncentrality  parameter  A^(i6^^)  (see  (3.285)j. 
Thus,  if  '^^-    is  the  power  associated  with  this  test,  then 


*fi=P(FHi>Pa,rL,m.  |Ha) 


=  P(Frj,m.  (Ai)>F«,      ),  iGlf^.  (3.301) 

Now  for  iGi^,  '^f-  monotonically  increases  in  A^  fsee  Graybill 

(1976),  p.  130).   However  A-  monotonically  decreases  in  ^^^  frecall 

(3.285)].   Since  ^fi='^i+Af  max'^f'  ^i  "'onoto°ically  decreases  in 

Ao     ,  iG^j:.   Therefore  it  is  desirable  to  have  A^^  ^^„  small. 
1 , max '1  1 , max 

The  design  parameters  A^,  ^^^^  and  A^  max"   ^"^^  following  lemma 


provides  upper  and  lower  bounds  for  Aj,  ^^y^   and  A^ 


max 
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Lemma  3.11 


Let  n^^^=min  n_  and  recall  the  definitions  of  77^,^  and  T)_r     in  (3.232) 
r  eT 

and  (3.264),  respectively.   Then, 


(a)   rr- 


c-1  V  J *^~°ri-l 

^-"r      (1) 


rGT 


n 


-i  '^r ,niax  -  (jV 


<^)  4r 


rW^~    n(0 


-  ^f , max  -  C 1 V 

n^  ^ 


Proof : 


(a)   Recall  that  e^jaxC^)  denotes  the  largest  eigenvalue  of  the  matrix 
A.   Then,  from  (3.223), 


•^r,max  —  ^max("r)  —  ^max(Qr'^'»r) 


<emax(K)eroax(Mr) 


=  emax(K)en,ax(Qrqr) 


=  ^maxC^)     (^r  ^^^  orthonormal  rows) 


.(0 


It  is  also  true  that 


emax(QrKQi-)>ijiYtr(qrKQi-). 
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The  term  on  the  right  being  the  average  of  the  eigenvalues  of 
QpKQp.   However, 

tr(QrKQ'r)  =  tr(KQi,Qr-) 

=  tr(K(Ic-Qfqf))        (see  (3.244)). 

From  Lemma  3.8(a)  and  for  Q^  =  [Q'^:  •  •  •  :Qj^_p]'  we  have 


QfQf  =  [:^ic:q^'] 


=  Jc  +  Qfqf. 


Hence 


tr(K(Ic-Q'fQf))  =  tr(K(Ic-Jc)-KQ^'Qf) 


=  tr(K(Ie-Jc))-tr(Kq|'Qp, 


Now, 


tr(K(Ic-Jc))=tr(K)-tr(KJJ 


=  tr(K)-itr(Klcl'c) 


=  tr(K)-il'cKlc 


_c-l   V  X 
~  c    Z^  n_- 
r  GT  ^ 
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Also,  tr(Kq^'Qp-tr(q^KQ^')-   But,  since  Q|q|' =  I^_^^^_i ,  it 


follows  that  — 3-^Q^q^'-q^KQ^'  is  p.s.d.   Consequently, 


i*T/n*'\  ^      1 


tr(q|Kq^')<-^(c-nri-l)- 


Thus 


c-1  V-  1   c-nj^-l 


tr(qrKq'r)>^  E  rr^ M- 


and 


•^r,max  —  x]-^-^ 


c-1  V  J: ^  ~  '>rl  -  1 

r  e  1       n^  ' 


iV)      Xj-  is  the  largest  eigenvalue  of  G^  defined  in  (3.266).   Thus 

^    ^        I ,  max  o      c3  1 


-^f  ,max  =  ^max(Gf )  =  emaxC^v^Qv) 


<emax(*^)«max(QOQv) 


,(0 


(q^  has  orthonormal  rows) 


On  the  other  hand,  it  is  true  that  A^  ^^^   is  greater  than  or 
equal  to  the  average  of  the  eigenvalues  of  q^Kq^.   That  is, 


emax(QvKQ;,)>47^r(q^Kq;,) 


-149- 
Now, 

tr(Q^KQO)=tr(KQ'vQv) 

=  tr(K(I-Q$'Q5)) 

where  Q^  consists  of  all  those  matrices,  Q^,  for  which  i  G  :f  . 

fVe  note  that  if  Qv  =  Q  then  trCQ^Kp'^)  =  tr(K)  =  J2    ifc-   And  thus 
^  tGT  '^ 

emax(qvKQv)>^  E  rrz-   Also,  in  this  case  r?^^  =  c  (see 
(3.264))).   Now, 

tr(K(I  -  Q$'Q$))  =  tr(K)  -  tr(Q5KQ5') 


=  E  ^-tr(q$KQ5'). 
rGT  ^ 


However,   -^QyQv' "  Qv^Qv'   is   p.s.d.      Consequently, 


■>CvnC/\  ^     1     ^-„/-nCnC/> 


tr(Q5KQ5')<-7p:tr(QX') 
n^    ' 

=  ;;fe(c-'^fi) 


Thus, 


rGT  ^      n"^^^ 
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Hence 


^f  ,max~  max 


emax(qvKQ;,)>^ 


rGT  ^ 


''fl 


.(0 


Some  remarks  are  in  order: 

(i)   In  an  unbalanced  random  effects  model  where  the  imbalance 

affects  the  last  stage  only,  such  as  described  in  Khuri  (1990), 
c  —  T/pj  =  1 .   Thus  the  lower  bound  given  for  Aj,  ^g^,^  in  Lemma 
3.11(a)  reduces  to  the  lower  bound  reported  in  Khuri 's  (1990) 
Inequality  (4.1). 

(ii)  If  Qq  is  not  one  of  the  matrices  in  the  definition  of  Q^  fsee 

(3.271)),  then  similar  to  part  (a)  of  this  lemma,  the  lower  bound 
for  Xn  can  be  improved  upon.   The  improved  lower  bound  can  be 

shown  to  equal 


c-1 


2-1    n^ 
rGT  ^ 


JO 


This  follows  similarly  to  the  proof  of  Lemma  3.11(a) 


(iii)  Recall  that  A^,  ^^^^   is  the  largest  eigenvalue  of  Qj-KQ!^  while 


f  .max 


is  the  largest  eigenvalue  of  Q^KQy.   Similar  to  the 


discussion  concerning  X^^^^   presented  after  Lemma  3.7,  Aj,  j^g^j^  will 

be  invariant  to  the  choice  of  Q„  and  \f   „„^  will  be  invariant  to 

*■       T ,  max 

the  choice  of  Qy. 
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3.3.7   An  Alternative  Approach  to  the  Analysis  of  the  Unbalanced 
Mixed  Model  with  Imbalance  Affecting  the  Last  Stage  Only 


Introduction.   In  this  section  we  reanalyze  model  (3.189). 
Instead  of  making  transformations  on  only  portions  of  u  (as  in 
(3.237),  (3.265),  and  (3.292)),  we  now  transform  all  of  u.   Ve  will 
again  utilize  resampling  in  this  transformation.   The  result  will  be 
a  set  of  sums  of  squares  which  are  independent  and  distributed  as 
scaled  chi-squared  variates  analogous  to  those  in  a  balanced  mixed 
model . 

This  approach  has  advantages  and  disadvantages.   One  advantage 
is  that  only  a  single  transformation  is  needed  to  reduce  the  analysis 
to  that  of  a  balanced  mixed  model.    Thus  exact  tests  (when  they 
exist)  for  the  variance  components  and  for  estimable  linear  functions 
of  the  fixed  effects  are  easily  constructed  (as  they  are  in  the 
balanced  case).   Also,  when  no  exact  tests  exist,  Satterthwaite' s 
procedure  is  readily  applied  in  the  usual  way. 

A  disadvantage  to  this  approach  is  that  the  power  of  the 

resulting  exact  tests  cannot  be  any  greater  than  the  power  of  the 

exact  tests  derived  in  the  previous  sections.   In  fact,  we  will  show 

that  the  power  of  the  exact  tests  resulting  from  this  new  approach  is 

equivalent  to  the  power  of  the  tests  we  would  have  obtained  if  we  had 

just  simply  deleted  observations  until  each  cell  contained 

n^  '^  =  min  n,.  observations. 
rGT  ^ 

This  disadvantage  is  the  principal  reason  why  we  recommend 
adopting  this  approach  only  when  a  few  of  the  cell  frequencies  are 
large  relative  to  n^  '  .      In  situations  where  only  a  few  of  the  cells 
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contain  n*^   observations  and  n^^^  is  small  relative  to  the  other  cell 
counts,  we  recommend  the  use  of  the  procedures  developed  in  the 
previous  sections. 

One  final  remark:   Unfortunately  some  notations  will  need  to  be 
reused.   However,  this  should  pose  no  difficulty  since  it  will  be 
clear  from  the  context  when  a  symbol  is  uniquely  defined  for  this 
section  or  when  it  has  taken  on  the  same  meaning  it  had  in  some 
previous  section. 

Utilizing  resampling.   Recall  the  definition  of  u  given  in 
(3.243),  namely  u  =  Qy  (q  is  defined  in  (3.214)).   From  (3.246)  we  have 


u~N 


QfHg 
0 


1=0    1     / 


where  Q^  is  given  in  (3.244)  and  G  is  defined  in  (3.247).   The  8- 
(i  =0,1,  •  •  •  ,j/)  are  defined  in  (3.217). 

The  matrix  R  (see  (3.226))  can  be  written 


as 


R  =  CAC'  (3.302) 

where  C  is  an  orthogonal  matrix  and  A  is  a  diagonal  matrix  whose 
diagonal  entries  consist  of  n-c  ones  and  c  zeros.   We  can  partition  C 
and  A  as 

C  =  [Ci:C2:C3] 

(3.303) 
A=:diag(Ic,Ij^_2^,0) 
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where  C^jCj,  and  C3  are  matrices  of  orders  nxc,  nx(n-2c),  and  nxc, 
respectively.   Ve  must  necessarily  assume 

n>2c.  (3.304) 

This  assumption  is  more  stringent  than  the  assumption  in  (3.195)  in 
the  sense  that  if  (3.304)  holds  then  (3.195)  will  hold.   But  the 
opposite  may  not  be  true.   Even  if  (3.195)  holds,  (3.304)  may  not 
hold. 

Since  C  is  orthogonal  we  have 


C'.Ci^I    (1  =  1,2,3) 


C'f.=0  (i^j). 


Consequently, 


Also, 


SSE  =  y'Ry  =  y'CiCiy  +  yC2C^y 


where 


(3.305) 


R  =  CiC;  +  C2C^.  (3.306) 


=  SSEi+SSE2,  (3.307) 


SSE.  =y'C.C'iy,      i  =  l,2.  (3.308) 
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Furthermore  y,  SSE^ ,  and  SSEj  are  mutually  independent  and 

SSEi/<t2  ~  xl 

(3.309) 

SSE2/a2~x^_2c- 

Now,  we  utilize  resampling  to  transform  the  vector  u  into  a  random 
vector  that  has  a  diagonal  variance-covariance  matrix.   Ve  do  this  by 
considering  the  cxl  random  vector  w  defined  as 


y  =  a+(AmaxIc-G)^y,  (3-310) 


where  Aj^^^^^  is  the  largest  eigenvalue  of  G.   Because  Aj^ax^c"^  ^^ 

1 
p.s.d.,  (An,axlc  ~^)'^  ^^  well-defined  with  eigenvalues  equal  to  the 

square  roots  of  the  eigenvalues  of  Aj^g^j^Ij.  —  G .   Ve  note  that  in  this 


case 


^max  =  emax(G) =  emax(QKQ') 


^maxC^)      (Q  ^^  orthogonal) 


=  -^       (n^^)  =minn^).  (3.311) 

n(0       ^      rex  ^^ 

Partition  w  just  like  u  in  (3.245).   That  is 

V  =  (y'f  :y'r)'  =  (yo:  '  •  •  ■■v'^-p'-v'^-p+V  '  '  '  =^1')'        (3-312) 
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where  each  y.  is  of  order  m^  x  1 ,  i  -  0  ,1 ,  ■  ■  ■  ,1/ .      The  next  theorem 
provides  some  distribution  properties  for  the  w- ,  i  =  0,1 ,  ■  •  •  ,1/. 

Theorem  3.10 


r  ^   pr   ^   /  ^i^^'      i=0,l,-.-,z/-p 

(a)   E(w.  )  =<  „^  - 

^  ^    ^-1^   I  Q,      i=i/-p+l,.  •  .,u. 


(b)  ^O'-i' ■  ■  ■ '-J^  ^^^  normally  distributed  random  vectors  with  u- 
having  variance-covariance  matrix 

Var(y.  )  =  (^.  +  1  c.|)I^  ,    i  =  0,1 ,  •  ■  •  ,z.. 
u*.  ;    1 

(c)  The  vector  w  is  independent  of  SSE2. 
Proof: 

The  proof  of  parts  (a)  and  (b)  follows  closely  to  that  given  in 
Theorem  3.8.   The  proof  of  part  (c)  closely  mimics  that  given  for 
Theorem  3.7(c) .  q 

Recall  from  (3.256)  that  Q •  Hg  =  Q .  r .  ,  i  =  0, 1 ,  •  •  •  ,  j/-p,  where  the 
vector  T^    is  defined  in  Lemma  3.10(c). 

The  vector  u.      Recalling  Lemma  3.9  and  that 


y  =  Xg  +  Zh  +  e  (see  (3.196)), 

y  =  Hg  +  Fh  +  e  (see  (3.206)), 
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and 


u  =  Qy  (see  (3.243)\ 

we  see  that  the  vector  w   fin  (3. 310) J  can  be  expressed  as 

y  =  Qy  +  (-(7yic-G)2c;(xg  +  zh  +  £) 

n^  ^ 

=  QHg  +  QFh  +  QI  +  (-3p-I^  -  G)5c;£  . 
n^  ' 

The  last  equality  follows  since  range(Ci)  C  range(R)  and  consequently 
CiX  =  CiZ  =  0.   Now,  let 

«-  =  Q«  =  (QrH)  =  (  0  )  (3-313) 

^-  =  «^  =  (q'F)  (3-314) 

£a.  =  Ql+(-77yIc-G)2C;£.  (3.315) 


n 


In  (3.313)  recall  from  Lemma  3.8(c)  that  Q.H  =  0  for  i  =  u-p+1 ,  ■  ■  ■  ,v . 
Also  q^  was  defined  in  (3.244)  while  Qj,  was  given  in  (3.220). 
Therefore  we  can  express  w  as 

y  =  Ht^g  +  F^h  +  e^.  (3.316) 

Clearly  E(fj^)=0.   Furthermore,  since  7  ==  De  (see  (3.200)),  it  must  be 
that  T  and  C^f  are  independent.   This  follows  since  DR  =  0  (cemma 
3.9(b)j  and  range(Ci) C range(R) .  Consequently, 
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Var(€^)=qVar(l)q'+(-i^I^-G)2c;Var(0Ci(-^Ie-G)^ 


-_   1   ^2t 


The  last  equality  follows  since  QKQ' = G .   Thus  model  (3.316)  describes 
a  balanced  mixed-model.   Letting  ^.  =5.  +_lo-2^  i  =  0,1 ,  •  •  •  ,:^,  we 

n  ^  ' 


obtain  from  Theorem  3.10  that 


n(M'  ifo^iS)-  (3.317) 


Let 


SS.  =w'.w.  ,   i  =0,1,  •  •  •  ,j/,  (3.318) 


then,  from  Theorem  3.10,  we  have: 


(i)  SS./^.  ~Xm.(Hi),   i=:0,l, •••,//  (3.319) 


where 


JjiiQ-Qiii,  i=  0,1,  ••-,,.-? 

i  =  v-p+l,  •  •  •  ,v. 


(3.320) 
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(ii)  For  MS.  =SS./m.  ,    i=0,l,---,i/ 


E(MSi) 


^i+iffrr'iQ'iQili,   i=0,l, •••,!.-? 


^i' 


i  =  iz-p+l , 


(3.321) 


Thus,  SSqjSSj,  •  •  •  ,SSj/,  and  SSEj  act  like  sums  of  squares  in  an  ANOVA 
table  for  a  balanced  mixed  model  (see  Khuri  (1982),  pp.  2915-291?). 
That  is,  analyses  concerning  the  variance  components  and  estimable 
linear  functions  of  the  fixed  effects  can  proceed  using  these  sums  of 
squares  just  like  in  a  balanced  data  situation.   In  fact,  if  the 
design  was  balanced,  then  similar  to  Section  3.3.5  we  can  show  that 
NSSq,NSSj,  •  •  • ,NSS^  reduce  to  the  usual  balanced  ANOVA  sums  of  squares. 
(Recall  that  N  is  the  common  last  stage  cell  frequency.) 

The  variance  components.   Using  SS    , -|^ ,  •  •  •  ,SSj^,  and  SSEj  we  can 
proceed  to  test  hypotheses  concerning  the  variance  components 
similarly  to  the  methodology  presented  in  Section  3.3.3.   Suppose  we 
wish  to  test  HQ:a-|=0   vs.  E^-.cr'^^^^O,    i  =  i/-p+l ,  •  •  •  ,i/,  then  the  power 
$•  ,  of  the  exact  test  (assuming  such  exists  based  on  SS^^^  ^-[^ ,  •  •  •  ,SS^, 
and  SSEj)  can  be  expressed  as  (take  SS^^^-j^  =  SSEj,  6^^-^^=0,    m^^j=— ^-y) 


fSS./m.  ,   ^ 

^  .  —  p  LI L  >  F         H 

*i-*^\^SSymj-'^«''"i'"'i(jl"ay 


i  =  t/-p+l . 


(3.322) 


SS- 
where  SS-/m-  is  such  that  E -^p- 


J'  J 


=  E 


SS 


when  HQ:cr|=0  is  true. 


J^a' 


SSj/mj 


J^,  Vk+-fr)^e 


kEWj 


SSj/mj 


•"i'™j' 


Under 
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Thus,    for    i  =  t/-p+l ,  •  •  •  ,J/, 


*i  -M.^mpmj^i+b^^^^a.n'pmjj 


(3.323) 


where 


1     _2" 


(3.324) 


(Compare  (3.223)  and  (3.224)  to  their  counterparts  expressed  in 
(3.299)  and  (3.300),  respectively.) 

Now,  let  us  suppose  that  the  design  is  indeed  balanced.  That 
is,  suppose  n^  =  N  for  all  tGT  (recall  (3.191)).  To  test  the  above 
null  hypothesis  we  would  form 


NSS-/m.  _SS-/mj 
^Bi  =  NSSj/mj  ^  SSj/nij' 


(3.325) 


SS. 


E 


SS. 


(i  =  f-p+l ,  ■  •  •  ,j/)  .   (Recall  that  SS-/m-  is  such  that  E -j^ 
HQ:<r?=0  holds.)  It  is  readily  shown  that  the  power  associated  wi 
this  test,  '^p-  ,  is 


when 
th 


*Bi  -T'"i,mj^i  +  b.eg.^a'">j,mjj 


(3.326) 


where 


%  = 


^    Vk  +  N^e 


(3.327) 
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(i  =  J/-P+1 ,  •  •  •  ,1/)  .   Thus,  the  power  of  the  above  exact  test  for 
Hq:<t^=0  in  the  unbalanced  case  fsee  (3.323)j  is  equivalent  to  the 
power  of  the  test  based  on  the  following  procedure: 

(i)   Randomly  delete  observations,  if  necessary,  until  all  r- 
cells  contain  n^  ^   observations. 

(ii)  Analyze  the  resulting  data  set  as  if  it  arose  from  a 
balanced  mixed-model  with  n^  ^    observations  per  cell. 

Ve  would  therefore  expect  the  methodology  developed  above  for  testing 
hypotheses  concerning  the  variance  components  (for  unbalanced  models) 
to  be  inefficient  when  most  of  the  cell  counts  are  much  larger  than 
n'^^.   Under  such  situations,  Aj,  ^^^^  will  typically  be  less  than 
l/n^^-^  and  thus  we  would  recommend  using  the  procedures  developed  in 
Section  3.3.3  (see  Lemma  3.11(a)j. 

Table  3.5  compares  the  powers  of  the  exact  tests  for  a^   for  four 
different  designs  arising  from  an  unbalanced  mixed  two-way  cross- 
classification  model  such  as  described  in  Gallo  and  Khuri  (1990). 
(Ve  assume  the  "a"  effect  to  be  fixed  and  the  "/?"  and  "a/?"  effects  to 
be  random  --  their  variances  being  ff a   and  <^^g-)      The  power 
expressions  used  in  the  comparisons  were  derived  in  (3.299)  and 
(3.323).   The  values  for  cr^  and  cr'^g   are  the  same  ones  used  in  Khuri 
and  Littell  (1987). 

Design  I  was  chosen  so  that  one  cell  would  have  a  small  cell 
frequency  relative  to  the  rest;  while  Design  II  was  chosen  so  that 
n^^-'^l.   In  both  cases  A^  max  <  ^Z"    ^°  that  the  testing  procedures 
developed  in  Section  3.3.3  are  to  be  recommended.   Design  IV  also  had 
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n'>^''  =  l.   However,  unlike  Design  II,  almost  25%  of  the  cells  had  only 
one  observation  in  them.   This  is  why  X^   max  =  ^  •   I"  this  case  we 
would  recommend  using  the  methodology  developed  in  this  subsection. 
Finally,  even  though  >^r ,ma.x< '^/^  i"  Design  III,  the  gain  in  power 

is  not  worth  the  extra  effort  necessary  to  develop  the  test 
statistics  in  Section  3.3.3.   Thus  in  "nearly  balanced"  designs  we 
would  again  recommend  using  the  procedures  developed  in  this  section. 

We  should  point  out  that  the  unbalanced  mixed  two-way  cross- 
classification  model  is  a  very  special  case  of  the  general  unbalanced 
mixed-model  (with  imbalance  affecting  the  last  stage  only).   Thus,  we 
may  need  to  modify  our  recommendations  when  more  general  models  are 
considered.   Ve  make  two  final  remarks: 

(i)   In  Section  3.3.4  we  described  a  procedure  for  constructing 
exact  tests  (when  such  exist)  for  testing  a  testable 
hypotheses  concerning  estimable  linear  functions  of  the 
fixed  effects.   The  power  of  these  tests  was  derived  in 
(3.301).   Recall  that  the  power  is  a  monotonically 
decreasing  function  of  ^^^  (see  Section  3.3.6).   Exact  tests 
for  the  fixed  effects  can  also  be  constructed  by  utilizing 
SSq,---,SSi,  ^see  (3.318)J.   As  in  Section  3.3.6,  the  power 
of  these  tests  will  be  a  monotonically  decreasing  function 
of  ^.  ^5i  +  l/n(0,2.   si„,^  ^fi^^i+A,^^^^.|  and  A,^^^^ 
<1/q^   '^  by  Lemma  3.11(b),  we  can  conclude,  similar  to  the 
recommendations  made  above,  that  when  most  of  the  cell 
counts  are  large  relative  to  n^^^  the  procedures  developed 
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Table  3.5    A  comparison  of  the  powers  of  the  exact  tests  for  a o 

in  an  unbalanced  mixed  two-way  cross-classification 
model 


Cell  counts: 

row=a  effect  .  , 

col=/3  effect  a^  <t^^  *r2  (^ee  (3.299)J    *2  (see  (3.323)J 

.2  .2  .138  .121 

2  10  9     .2  5.0  .059  .059 

I  10  10  9    1.0  1.0  .229  .215 

9   9  10    5.0  .2  .787  .750 

5.0  5.0  .273  .268 

^r, max  =-370;  l/n^^^  =  .5 


.2  .2  .102  .091 

14  4     .2  5.0  .058  .058 

II  3   3  4    1.0  1.0  .193  .175 

4   4  3    5.0  .2  .687  .633 

5.0  5.0  .260  .252 


'r ,  max 


.750;   l/n(^)  =  1.0 


5  5  5  5  6     .2   .2           .473  .470 

6  5  6  5  5     .2  5.0           .082  .082 
III  6  4  5  6  5    1.0  1.0           .668  .667 

6  6  5  5  6    5.0   .2           .994  .994 

6  5  5  5  6    5.0  5.0           .721  .721 

^r, max  =-2457;  l/n^^^  =  .25 
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Table  3.5  —  continued 


Cell  counts: 
row=a  effect 

col=/?  effect  <T^  al^       ^^2  (see  (3.299))    *2  (see  (3.323)) 

9  2  9   12  .2     .2 

10   1  2  9  10  .2  5.0   In  this  case  the  powers  are  identical 

IV   1   8  1   2  2  1.0  1.0   since  Ar^„a^  =  l/n(^)=l. 

9  10  1   9  3  5.0     .2 

8  3  2  10  1  5.0  5.0 


in  Section  3.3.4  should  be  used.   Otherwise  construct  exact 
tests  by  utilizing  the  sums  of  squares  developed  in  this 
section. 

(ii)  Our  methodology  has  an  advantage  over  the  methodology  based 
on  deleting  observations  until  all  r-cells  contain  n^^-^ 
observations  (as  described  earlier)  in  that  it  uses  all  the 
data  in  the  calculation  of  the  test  statistics.   This  is 
clearly  a  desirable  feature  especially  if  n^^-^=l.   For 
example,  if  n'-^'' =  1  then  the  "deleting  observations 
methodology"  could  not  produce  a  test  for  cr^   ^^  ^  two-way 
cross-classification  model,  whereas  our  procedure  would 
allow  us  to  construct  such  a  test. 

Ve  now  turn  our  attention  to  the  development  of  exact 
simultaneous  confidence  intervals  on  estimable  linear  functions  of 
the  fixed  effects.   Since  model  (3.316)  describes  a  balanced  mixed- 
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model,  we  will  use  the  results  of  Khuri  (1984)  to  obtain  our 
intervals. 

The  fixed  effects  parameters;   Introduction.   In  Section  3.3.4 
we  developed  exact  procedures  concerning  estimable  linear  functions 
of  the  fixed-effects  parameters.   The  development  involved  first 
constructing  the  vector  v  ^see  (3.257)V  which  consisted  of  those  u-, 
i  =0,1,  •  •  •  ,i/-p   (and  the  corresponding  u^  ,  1^  =  u-p+1 ,  ■  ■  •  ,i/)    for  which 
^i='5-  ,  then  making  a  transformation  to  arrive  at  un  (see   (3.265)). 
The  (5j^,  k  =  0,l,---,j/  were  defined  in  (3.217).   The  w^^ ,  i  e  3"  (see 
(3.259)  and  (3.268)j  were  then  utilized  in  the  construction  of  the 
exact  procedures.   The  motivation  behind  the  construction  of  v  came 
from  the  procedures  used  in  dealing  with  estimable  linear  functions 
of  the  fixed-effects  parameters  in  a  balanced  data  situation.   For 
balanced  data,  exact  methods  can  be  derived  if  a  proper  mean  square 
can  be  found  in  the  corresponding  ANOVA  table  that  can  serve  as  an 
"error"  term  in  the  F-ratio  used  for  testing  a  hypothesis  concerning 
the  parameters  associated  with  the  fixed-effects. 

Since  SSq, • • • ,SS^  and  SSEj  fsee  (3.318)j  mimic  the  sums  of  squ 
associated  with  a  balanced  mixed  model,  we  can  also  develop  exact 
procedures  concerning  estimable  linear  functions  of  the  fixed-effects 
parameters  (based  on  the  methodology  developed  in  this  section)  if  we 
can  find  a  proper  mean  square  that  can  serve  as  an  "error"  term  in 
the  F-ratio  similarly  to  the  balanced  data  situation.   If  no  mean 
square  exists  that  can  serve  as  an  "error"  terra  in  the  F-ratio,  then 
we  can  apply  Satterthwaite' s  procedure  in  the  usual  way  to  develop 
approximate  methods.   However,  as  pointed  out  in  a  recent  article  by 
Ames  and  Webster  (1991),  in  many  instances  the  Satterthwaite 


ares 
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estimator  of  the  denominator  ("error")  degrees  of  freedom  will  be 
positively  biased.   This  will  cause  the  resulting  inference  to 
overstate  the  actual  Type  I  error  rate.   This  same  criticism  also 
applies  to  the  approximate  method  developed  in  Section  3.3.4.   (in 
certain  layouts  Ames  and  Webster  (1991)  develop  an  improved  estimator 
of  the  true  denominator  degrees  of  freedom.   Details  can  be  obtained 
from  this  article.) 

For  balanced  mixed  models,  Khuri  (1984)  developed  a  method  for 
the  construction  of  exact  simultaneous  confidence  intervals  on 
estimable  linear  functions  of  the  model's  fixed-effects  parameters  in 
situations  when  no  proper  mean  square  existed  which  could  serve  as  an 
"error"  term.   We  now  extend  his  results  to  the  unbalanced  mixed 
model  where  the  imbalance  affects  the  last  stage  only.   The  idea  is 
to  find  a  sequence  of  independent  and  identically  distributed  random 
vectors  each  having  a  normal  distribution.   Furthermore  their  common 
mean  vector  will  consist  of  a  vector  of  estimable  linear  functions  of 
the  fixed-effects  parameters.   Exact  simultaneous  confidence  intervals 
on  these  estimable  linear  functions  can  then  be  obtained  by  utilizing 
this  sequence  of  estimators. 

The  fixed  effects  parameters;   Notation  and  preliminaries. 
Recall  that  r  =  (kj  ,k2,  •  •  •  ,kg_-^)(see  (3.191)).   By  rearranging,  if 
necessary,  the  order  of  the  elements  in  t,  we  can  consider  the  first  Sj 
subscripts  (l<Si<s-l)  to  be  associated  with  the  fixed-effects  in 
(3.316);  the  next  s^   subscripts  (l<S2<s-l)  are  nonnested  random 
subscripts,  that  is,  subscripts  associated  with  those  random-effects 
in  (3.316)  which  do  not  nest  each  other  and  are  not  nested  in  any 
other  random  effects;  and  the  last  S3  subscripts  are  associated  with 
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random  effects   nested   in  at   least  one  of   the  preceding   Sj   effects.      We 


note   that 


Define 


Sj  +  S2  +  S3  =  s-1.  (3.328) 


s-1 


di  =  .    n      a.  (3.329) 


l=Sj+l 


s-1 
f.        n         a- ,      if   Si  +  S2  +  l>s-l 

l=Sj+S2+l     ^  ^ 

^2=  1,  else  (3-330) 


S1+S2 


^3=    .    n      a.  =di/d2.  (3.331) 

l=Sj+l     ^ 


s-1 


Recall    that   c=     .Ha-    (see    (3.192)).       Also    let   A   be    the    set   of   vectors 
of  dimension   Sj   of   the   form 

A-{?^:t^=(is^  +  l,i3^^2'-  •  •'isi+s2)'}  (3.332) 


with 


ig —  1 ,2,  •  •  • ,a£,         £  =  Sj+1 , •  •  • ,Sj+S2. 

Ve  note  that  the  number  of  vectors  in  A  is  dg.   Now,  from  (3.203)  and 
(3.204)  recall  that  the  cxc^  matrix  H^  was  defined  as 


«i=  J®jL.g,   i=0,l,...,^ 


where  c^  was  given  in  (3.194)  and 
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i«-\l    for  k^G^f 


(i  =  0,l,  •  • -,!/;   £=1,2,  •  •  •  ,s-l).   Hence,  for  i  =0,1,  •••  ,1/, 


Hi=(^®h£)  ^(J^f^iWg)-  (3.333) 


Thus,  for  i  =0,1,  •  •  •  ,i/-p  (that  is,  H^  is  a  matrix  associated  with 
fixed-effect)  we  have 


=  (^®L.g)  ®lj^.  (3.334) 


This  follows  since  the  subscripts  associated  with  the  random  effects 
in(3.316)  cannot  belong  to  rjj^,  the  set  of  subscripts  associated  with 
the  fixed  i*  effect  in  the  model,  i  =  0, 1 ,  •  •  •  ,i/-p.   Note  that  (3.334) 


can  a 


Iso  be  expressed  as  fsee  (3.331)] 


"i=(gfl^i«)  ®ld3«'id2'  i=0,l,...,i.-p.    (3.335) 


Furthermore,  from  (3.313),  let  the  cxc-  matrix  H  .  be 

1  UJl 


"wi  =^^i'  i  =0,1,- •  •,z/-p,  (3.336) 


then 
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H    .  =Q 


(|lW«) 


ld3®id. 


,    i  =0,1,  •  •  •  ,i/-p,  (3.337) 


For   i  =  0 , 1 , • 


fV-p  and   If  e  A  define   the   cxc|    matrix  H^  as 


.        /I         \      /  ^^      1     i\ 
Hf  =      0  L.J®        ®      la!)®lj   , 
1       W=l    ^^^     ^^£=81+1    V      ~*^2' 


(3.338) 


where  1^^  is  a  vector  of  dimension  a^  (£ =  s^+l , • • • , Sj+Sj)  whose  i^^^ 
entry  is  unity  and  the  remaining  entries  zero  (i«  =  1 ,2,  •  •  • ,a»)  . 
Similar  to  (3.337),  define  cxc-  the  matrix  H^.  as 


H^.=QHf,   i=0,l,-..,i/-p;  ^eA. 


(3.339) 


Finally,    recall    that  A^^  =  H^H'.  ,      i  =  0,  •  •  •  ,i/  (see    (3.207)).      Thus,    using 
(3.208),    we  have   for   i  =  0, 1 ,  •  •  •  ,r/-p. 


A-  =(    ®  M..)®f    %      J     ) 
=  (J/i£)®Jd3®Jd2- 


(3.340) 


Furthermore,  define  the  cxc  matrix  A  ■ 


UJl 


as 


A.,i=Ha;iHLi=«HiHiQ'  =  QAiQ' 


(3.341) 


(i  =0,1,  •  •  •  ,j/-p).   Also,  for  i=0,l 


,.,-■•  ,i/-p;  VGA  let  A^  =  H^Hf  and 


A^i=HEiH^i-   Then 


(3.342) 
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and 

A^i=:QH^Hfq'  =  qA^Q'  (3.343) 

where  J^^   is  an  a^xa^  matrix  whose  (ig,ig)  entry  is  unity  and  the 
remaining  entries  are  zero,  ig = 1 ,2, •  •  ■ ,ag;  «  =  s^+l , •  •  ■  ,Si+S2.   Both  Aj 
^^'^  ^ui    ^^^   °^   order  cxc,  i=0,l,---,j/-p;  j/GA. 

The  fixed-effects  parameters:  Initial  development.   From  Khuri's 
(1990)  Lemma  3.2  we  have  for  i  =  0,1,  •  ■  •  ,i/-p 


The  last  equality  follows  since  A^  •  =  0  for  i  =  0, 1 ,  •  •  •  , j/-p  and     j  =  t'- 
p+l,---,i/  ^see  Khuri  (1982)j.   The  cxc  matrix  P.  is  associated  with 
the  sum  of  squares  for  the  i*'*  effect  in  (3.213),  i  =  0 , 1 ,  •  •  •  ,z/-p. 


Define  the  cxc  matrix  P  •  as 


P.i=g^A^j  =  <g^A.)Q',  (3.345) 


(i -0,1,  •  •  •  ,i/-p) .   From  (3.344),  we  can  equivalently  express  (3.345) 
as 


P^i  =  qPiQ',  i=0,l,...,z.-p.  (3.346) 


Furthermore,  define  the  cxc  matrices  P^  and  P- .  as 

1  (jJ\ 


PI  =  E(-^)a^,  i=0,l,- •  .,i/-p;  VGA  (3.347) 

J=0^  j^  J 
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pL='!?(-F^Hj  =  '^Pi'^''  i=0'l'---.'^-P;  Y^A.         (3.348) 


j=0^"j 


The  last  equality  in  (3.348)  follows  from  (3.343). 

Now,  consider  the  Product  P^^^H  .,  for  i  ,  i' =  0,1 ,  •  •  •  ji^-p.   From 
(3.337),  (3.340),  and  (3.345)  we  obtain 

Pu.iH.i'  =  «Pi«'«"i'  =  «Pi"i' 


=.{(Y(^h)((^v>^; 


j=0^"j 


=  Q^ 


I  i/-pA.  .  sj    . 

^  1=0 '^1  «=1  ^^'         1 


j=0"j 


(|lh'>id, 


=  d,Q|(g^|^Mj,L.,>l,j. 


(3.349) 


Similarly,  from  (3.337),  (3.342),  and  (3.347)  we  have 


PEiHu.i'  =  '^PlQ'«"i'  =  «Pl«i' 


J/-P  .A. 


H  S(^)*U(i^i'>W«^ 


^-P  A. j  si 


Si+S 


q|(  .e  ^ ^f/j£Li,X  ^iJ4^ia\^         ^'-'"'^ 
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(i,i'  =  0,---,i^-p;  J^6A).   Define  the  matrix  N.  ./  as 


N-./^'lf  tM  ®  M.,L.,.,  i,i'  =  0,l, •••,!/-?.  (3.351) 

11   j=0   j  ^=1  J  1  '^ 


Then,  from  (3.349)  we  see  that  for  i  =  0, 1 ,  •  •  •  , J/-p, 


Pa;i«u;  =  Pu;i('3"0  =  QHl=  •••=«"»/-?)      (^^^  (3.313)) 
=  Pwi("wO  =  «a.i=---=Ha.,i.-p)    (see  (3.336)) 


=  (diQ[Nio®ld^]:---:diQCNi,^_p®ld^]) 


=  diq(Ni®lj^),  (3.352) 


where 


Ni  =  (Nio:N.j:---:N.^^_p),  i  =  0,1 ,  ■  ■  ■  ,u-p.  (3.353) 

Similarly,  from  (3.350)  we  have  for  i  =  0,1 ,  •  •  •  ,t^-p;  i^GA, 

P^iHo.  =  d^Q^Ni  ®(2^''^li>  Id,)-  (3.354) 

Thus,  for  i  =  0,1 ,  •  •  •  ,i/-p;  fGA, 

rank(P^.H^)  =rank(N.)  =rank(P^iH^) 

=  rank(QP^q'QH)    (see  (3.346)  and  (3.313)) 
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=  rank(P^H) 

=  rank(Q'.Q.H)      (recall  (3.254)) 

=  rank(Q-H)         (Q'-  has  full  column  rank) 

=ra^      (by  Lemma  3.10(a)V  (3.355) 

Now,  there  exists  a  matrix  C^  of  order  m-xclg  (dg  =  n  a^)  and  rank  m^ 
such  that  C*N-  consists  of  m-  linearly  independent  rows  of  N-, 
i  =0,1,  •  •  •  jt'-p.   Define  the  m-  x  c  matrix  T»  as 


T-.=Ki^.P«^.  (3.356) 


where 


k!^,.  =|C1®('®''4)011)q',  (3.357) 


wi 


(i  =0,1,  •  •  •  ,i/-p;  i/€  A). 

The  fixed-effects  parameters:   The  sequence  of  estimators.   For 
each  i  =  0, 1 ,  •  •  •  jt'-p;  j^  €  A  define  the  c  x  1  random  vectors  ,  zr  ,  as 

zf  =  T^.y.  (3.358) 


We  now  establish  some  distributional  properties  for  the  zt .   Before  we 
do  so,  however,  we  will  need  the  following  lemma. 
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Lemma  3.12 


Recall    the   definitions   of   P.    in    (3.344)    and   P»    in    (3.347), 
i  =  0,1 ,  •  •  •  ,i/-p;    j/GA.      Then 


(a) 


yl^ul^    _J"3 


^P\,      i  =  i'  =  0,l,-  •  ■,u-p;    ueA 


^Vi'-\     0,         i#i';    ue 


(b) 


P.P?, 


^  _/i^[%®Jd,]Pl'      i  =  i'  =  0,l, ••-,.-?;    i^GA 
0,  i  7^  i';  J^e  A. 


Proof: 


(a)   From  the  fact  that  P-  is  idempotent,  i=l,  •••,!/  (see  Khuri's 
(1990)  Lemma  3.l),  we  have 


i/-p   A-  •   -2  ^-P  A 


A;:;A,  ,, 


'^^^h^'^^ht^'^'sh^'^'^ 


=  d,   E 


''-P^ii  fsi 


^o'^w^'^'^'^S 


(see  (3.340)) 


=  d, 


i/-pA?j  si 


^  j=0  ^]   ^=1  J     -^  ^   ^1 
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+ 


(      E       u   h         ®M.„M.,.®J, 


f,  J/-pA..Si         \2 


E-Fr^®M.j®j,  =p.,  1  =  0,1, ■■■,u-p. 


Consequently,    we  must  have 


,J/-pA..Si        .2     J^-pA.  -sj 


(3.359) 


Thus,    for    i  =  0, 1 ,  •  •  •  ,i/-p;    J^GA,    we   obtain 


J=0     J 


(see   (3.347)) 


1/-P  A.  . 


Si+So 


1  /  1'2       l/j\ 

£=1    J^     ^    £=si+l   ^e^       ^: 


(see    (3.342)) 


u-pX 


Si+Si 


t2 


"^"^iJi^     f«,^"'l 


E  -]-^®  M.J  (g)        ®      Jaf  ®Jd 
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^   i^-pA^  j  Sj        X      ,  s,+s 


(i   .5,-^i"j^)®(,.l^,l-^a>  Jd,  (f--   (3-359)) 


(see    (3.331)). 


Since  P^P.,  =  0   for    i  ^  i'  (see  Khuri's    (1990)    Lemma  3.1))  we   have, 
for    i  i^  i', 


1/-P   A. 


J/-P  A-  -A.,.  A.  -A.,., 


A -A.,. 


j.O        b^j  J   J     j  ^J'   bjby        J    J 


After  some  simplification  similar  to  the  above  we  obtain 


0  =  d 


v^   A.  jA.,.  s^ 
j=0 


(•?n^^iVj>^ 


+ 


/      ^i  i'^i'i'  ^1       \ 


=  d, 


A,/,/ 


I/-P  A.  .  Sj    ^  I/-P  '^i/-/     . 

^jtb  i^j  «=i  J^^jto  V  J^^ 


j=0  j'=0  ^j^j'  «=i  •^'^  J  ^ 


(3.360) 


Thus,  for  i  ^  i';  i^€A,  we  have 
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l/-p  I/-pA-   ;A.,  ., 


kW~., 


j=Oj'=0  '^jV       '  J 


1/-P   l^-P'^ij-^i/j/fSi  ,    Sj  +  S 


=  d,.E     E^--5^®Mj,M.,,®(^^®^^Ja^0Jd,}     (-e    (3-342)) 


=  d. 


^_p  i/_pA-  -A., .,   Si  \     /  Si+s,    !„>, 


=  0  (by   (3.360)). 


(b)      For   all    i  =  i'  =  0,l,---,t/-p  and   i^  6  A, 


.  v-p   A-  •        , ,  I/-P    A.  ., 


i/-p  A^  •  A-  -A.  ., 

j=o  bj     J  J  j^y    '^jbj'    J  J 


From   (3.340)   and    (3.342)   we  can   express   this  as 


j=0    b. 


Si+Sn 


°1  /    °1^'='2    ,1«\ 


+  d2      E 


^iAj' 


'2     .^.,       b.b., 


Si+So 


®^^:,i^yt®^6i^j^l/Jf)®^^ 


=  d 


{ 


f-p   A|  .    sj  '^ii^ii'Si 

j=0    b^j    g=l    J«   J^     j  ^  j/  bjb  ,  g^i    je   J  i 
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St  +  S 


(12    1   \ 


=  d. 


S,+So 


■j/-pA.  .  sj 


(12   i  \      I 


"0    "3    °2 1 


[%^^fl- 


When  i  7^  i'  and  i;  G  A  we  have,  reasoning  similarly  to  the  above  and 
using  (3.360),  that 

PiPf.  =  0.  [ 

Now,  recalling  (3.358)  and  using  Lemma  3.12,  we  have  for 
i  =0,1,  •  •  •  ,i/-p;  ^;  G  A, 

(i)  E(zf)=T^.E(y)=T^.H^g  (see    (3.317)) 

=  K^iP^iHc.g  (see    (3.356)) 


K: 


Si+S 


Yi®(J3®/lia^)®ldJg  (see    (3.354)) 
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=  d2[C^N.  ®(l)®d2]  g       (see  (3.357)) 


=  d^qNig. 


(3.361) 


(ii)   Var(zf)=T^i[Var(y)]r^'i 


^  i=i/-p-l-l   ^     n*^  '' 


Jc 


The  term  in  the  brackets  follows  from  (3.316)  and  the  facts  that 
Var(€^)=-^2j^  (see  (3.315))  and  b  =  (^;,_p^i :  ■  •  •  r^^,)'  with 
Var(/?j)  =a?Ic.  ,  i  =  i/-p+l ,  •  •  •  ,  J^  (see  (3.196)J.   Now  from  (3.314) 
and  (3.348)  we  have,  since  Pj  is  symmetric  (see  (3.347)  and 
(3.342)), 


Var(,^)  =  ^l^MiJ^^/^^'^  +  ^nn 


Now,  the  first  term  in  the  bracket  can  be  written  as 


P^f(      e       <^l)F'Pf  =  Pf(        t      ^ViH  («^^   (3.210)) 


j=i/-p+l    -J      -^ 


(see  Khuri's    (1984)    Lemma  5.2) 


where 


^     JO,  ^iOj 

^ij      \byd3,      V'iCt/'., 


(3.362) 


(i  =  0,1 ,  •  •  •  ,i/-p;  j  =  i^-p+1 ,  •  •  •  ji';    u  e  A)  .      Also 
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b*. 


j  =  ^-p+l, 


(3.363) 


where 


^J«-\  1,    kgGVj, 


(3.364) 


(see  Khuri  (1984)V   Therefore,  recalling  (3.217), 


Hence,  from  Lemma  3.12(a)  and  (3.357), 


Var(zf)=K^iQ 


'U.^^.^y^yi-i^hf'^i 


d'Kw 


s,+St     ;' 


m<  .j;/^>''^^)hh,'y^-^>i 


Sn+S 


C-'®         ®      la!  ®1h 


(3.365) 


Now,    let 


u-p  A.  .    sj 
B:  =  E    TT^    ®  Mig,    i=0,l, 

j=0    ''j  «=i  J'^ 


,i/-p 


(3.366) 


^1=^      E    b*.aU,2//i)^        i^o,l^ 


•  ■  ^i^-p, 


(3.367) 
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see 


(121fl\  / 

®      ^a.J'^^A   »    i  =0,1,  ■  •  •  ,J/-p 
£=Si+l      ^^  2  ^ 

(3.347)   and    (3.342)],    we  have   that    (3.365)    can   be   expressed  as 

ri[c^B.Ct'®(l)®(d2)]  =  ^td2ctB.Cl;',  (3.368) 

(i  =0,1,  •  •  •  ,i/-p;    i^G  A). 


(iii)   For  i  ^  i',  tfGA,  and  recalling  P~^  is  symmetric 


Cov(zf,z|,)=T^i[Var(y)]T^'.,      (see  (3.358)) 


•^wi^wi 


®  ^^Im  V-./K-'.,    fsee  (3.356)  and  (3.317)) 
i=0    i  ^^    ^■^  V    ^     ^  / 


(see   (3.348)  and  (3.214)) 


Qo 


■^  1    wi 


(see  (3.252)) 


wi^  1 


d 


t^^^iiv^yi' 


Q'K^'. ,  =  0 


(3.369) 


This  follows  from  Lemma  3.12(b)  and  the  fact  that  P^  is 
idempotent  for  i  =  0, 1 ,  •  •  •  ,z/-p.   We  have  therefore  just  proven  the 
following  theorem. 
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Theorera  3.11 

The  random  vectors  zq  ,    Zf ,  •  •  •  ,zp_p  are  independent  normally 
distributed  vectors  with 

(a)  E(z^)=d2c^N.g,  i=0,l,---,i^-p;    u^K 

(b)  Var(zf)=^td2ctB.q',  i  =  0,1 ,  •  •  •  ,;.-p;  u€K 

where  dj  was  defined  in  (3.330);  g  is  the  vector  of  fixed  effects 
defined  in  (3.196);  C\    is  such  that  C^N-  consist  of  m-  linearly 
independent  rows  of  N^  m^  was  given  in  (3.353));  B-  was  gi 
(3.366);  and  (,\   was  defined  in  (3.367). 


This  theorem  shows  that  the  random  vectors  in  the  set  {z^:i/GA} 
share  the  same  mean  vector,  djC^N-g,  and  the  same  variance-covariance 
matrix,  C^'^l^*^'^ f*^ ,    j  =  0, 1 ,  •  ■  •  ,i.-p.   The  matrix  C^BjC^  (  j  =  0, 1 ,  •  •  •  ,z/- 
p)  is  nonsingular  since  if  it  were  singular,  the  rows  of  T-  •  fsee 
(3.356)  and  (3.358)j  would  be  linearly  dependent.   But  this  impli< 
that  the  rows  of  T^ -H^  would  be  linearly  dependent  which  is  a 


:iven  in 


les 


contradiction  since  T^ jH^^  =  d^C^N j  by  (3.361)  and  C^  was  determined  so 
that  C-N-  would  have  linearly  independent  rows. 

Further,  CjN-g  is  estimable  for  all  j  =  0, 1 ,  •  ■  •  ,i/-p  since 


diC^Njg  =  T^jH^g  (by  (3.361)) 

=  T^jQHf  .         (by  (3.313)) 


-182- 
=  TEjQE(y)  (by  (3.210)). 

Also,  for  each  j  =  0, 1 ,  •  •  •  ,i^-p  and  i/€A,  the  rows  of  P-  -H, ,  and  those  of 
^wi^w  span  the  same  vector  space  spanned  by  the  rows  of  N-  and  hence 
by  C^Nj  (see  (3.354)  and  (3.355)).   Since  P^jH^  =  QPjQ'QH  =  QPjH  (see 
(3.346)  and  (3.313)),  the  space  spanned  by  {qPflHg, •  •  •  ,QP^_pHg}  which  is 
the  same  as  the  space  spanned  by  {QqQq^' '  '  "  »Ql/-pQj/-pHg}  is  the  space 
of  estimable  linear  functions  of  the  fixed-effects  parameters  in  model 
(3.316)  (recall  Lemma  3.10(c)  and  (3.352)).   Thus  {C^N^g,  •  •  •  ,C*_pNj,_pg} 
must  also  span  the  space  of  estimable  linear  functions  of  the  fixed 
effects  parameters.   Consequently,  we  can  restrict  our  attention  to 
the  development  of  exact  procedures  concerning  CgNgg, • • • ,C^_pN^_pg. 
The  fixed-effects  parameters:   Exact  procedures.   Consider  the 
set 

Ao  =  {if:?^  =  qls2'  l<q<^o}  (3.370) 

where  q  is  constant  and 


^0=   1  3^5-     ^i-  (3.371) 

"   Sj  +  l  <  1  <  Sj  +  Sj  ^ 


Also,  define  the  subset  of  vectors  J"  •  to  be 


^j  =  {^y.iieAQ},    j  =  0,l,  •  •  •  ,j/-p.  (3.372) 

Then  as  explained  in  Khuri  (1984,  p. 21)  the  sequence  of  Tq  random 
vectors  in  J'j  ( j  =  0,1 ,  •  •  •  ,j/-p)  are  statistically  independent.   Since 
these  random  vectors  are  normally  distributed  with  the  same  mean 
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vector  and  variance-covariance  matrix  (see  Theorem  3.11),  it  follows 
t  an  unbiased  estimator  of  ^^  ^see  (3.367)  and  Theorem  3.11(b))  is 


tha 
given  by 


tr[Sj(d^C^BjCp-Vmj(ro-l)]  (3.373) 


where  S-  is  the  matrix 


'J=.?.  (?3-?j)(?3-Wj)'  (3-374) 


uEAq 


and 


?i=T^   E  ?1,  (3.375) 


•J   "0  ^GA 


j  =  0,1,  •  •  •  ,j/-p  ^see  Anderson  (1984,  p.  430)].   Further, 

tr[Sj(d^C*.B.Cp-]/rj~4.(,^_l),  (3.376) 

(  j  =  0,1,  •  •  •  ,i/-p) .   Since 


( j  =0,1,  •  ■  •  ,j/-p),  it  follows  that 


ro(^j-<ilC]l^jgndlC^B.C*.r\^.-d^N.g)/e^^xl.,  (3.377) 


( j -0,1,  •  •  •  ,j/-p).   Finally,  because  z-  and  S-  are  independent  (s 
Anderson  (1984,  p.  71))  we  conclude  that 
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(.,-dic>,g)'(dic]B.cy)-(zj-djqNjg) 

'O('O-I) tr[Sj(d^CpjCy)-]  fmj.mjCrj-l)'      (3.378) 


( j  =  0,1,  •  •  •  ,J/-p)  .   This  formula  allows  us  to  test  a  testable 
hypothesis  concerning  C*-N  •§(  j  =  0,1 ,  •  •  ■  ,J^-p)  using  an  exact  F-test. 

Simultaneous  confidence  intervals  on  linear  functions  of  C-N-g 
( j  =  0,1,  •  •  •  ,J/-p)  can  be  obtained  from  the  formula 


p{d2fC^Njg€|'lj±Co[|'(d2c^BjCp|p}  =  l-a,        (3.379) 


( j  =  0,1 ,  •  •  •  ,i'-p)  ,  where  |  is  a  vector  of  constant  coefficients  and 


:0={tr[Sj(d^C*.BjCp-]F^^^.,,.(,^_l)/ro(ro-l)}' 


As  a  last  remark,  the  matrix  C*-  ( j  =  0, 1 ,  •  ■  •  , J/-p)  is  not  unique. 
However,  as  pointed  out  in  Khuri  (1984,  p.  22),  the  F-statistic  given 
in  (3.378)  and  hence  the  confidence  interval  given  in  (3.379)  is 
invariant  to  the  choice  of  the  matrix  C-. 

3.3.8.   Numerical  Examples 

Dyeing  of  cotton-synthetic  cloth.   To  begin  with  we  use  the  data 
in  Table  3.6  to  illustrate  the  methodology  developed  in  Sections  3.3 
and  3.4  of  this  chapter.   The  data  in  this  table  represent  numerical 
scores  resulting  from  a  comparison  between  a  finished  version  of  a 
dyed  cotton-synthetic  cloth  and  a  standard.   Two  temperatures  (Factor 
A),  three  cycle  times  (Factor  B) ,  and  three  operators  (Factor  C)  were 
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selected  and  at  most  three  small  specimens  of  cloth  were  dyed  under 
each  set  of  conditions  fsee  Montgomery  (1984,  p.  245)j.   Ve  assume  that 
Factors  A  and  B  are  fixed  while  Factor  C  is  random.   The  appropriate 
model  is  given  by 

yijk  =  /^  +  «i  +  /?j  +  («/?)ij  +  7k  +  («7)ik  +  (/5T)jk  +  («/?T)ijk  +^ijkl' 

i  =  l,2;  j=l,2,3;  k=l,2,3;  £=1 ,2,  •  •  •  ,n.  jj^. 

Our  first  task  is  to  analyze  the  variance  components.   To  aid  in 

the  understanding  of  the  application  of  the  testing  procedures,  the 

reader  is  referred  to  Table  3.7  which  lists  some  key  quantities  used 

in  the  development  of  the  exact  tests.   The  expected  mean  square 

values  of  MS  •  =  SS  -/m-,  1=4,5,6,7,  are  provided  in  Table  3.8.   The 
n     ri'  1 

results  are  summarized  in  Table  3.9. 

Ve  can  see  from  Table  3.8  that  no  exact  test  (based  on  our 
procedure)  exists  for  testing  Hq:(t^  =  0.   Thus  we  must  apply 
Satterthwaite's  procedure  and  construct  an  approximate  test.   The  test 
statistic  in  this  case  is  given  by 


4-MSr5  +  MSr6-MSr7 


which  has  an  approximate  F-distribution  with  2  and  t)     degrees  of 

freedom,  where 

(MS^5  +  MS^6-MS^,,)^    _^  728 


The  observed  level  of  significance  is  0.3356. 
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Table  3.6.    Numerical  scores  resulting  from  a  comparison  between 
dyed  cotton-synthetic  cloth  and  a  standard 


50 


60 


Temperature 


300°  350° 


Operator Operator 


Cycle 
Time     1 


36 

38 

35 

39 

36 

37 

34 

35 

38 

36 

28 

35 

26 

24 

35 

27 

34 

25 

23       28       31  24       36       36 

40      25       26       32  35       39 

29 


33  37       34       34 


26 

36 

28 

29 

37 

26 

34 

24 

(source:   Montgomery  (1984,  p.  245)j 


Ve  now  turn  our  attention  to  the  analysis  of  the  fixed-effects, 
The  reader  is  referred  to  Table  3.10  which  lists  some  quantities 
needed  in  the  construction  of  the  exact  tests  (see  also  Table  3.7). 
From  Lemma  3.10(c)  we  may  restrict  our  focus  to  testing  hypothesis 
concerning  lo'Ii'Is'  ^°^  Is*  ^'^   this  case  these  quantities  become: 
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(ai-a+(a_^)l.-(a_^))l9 
(a2-a+(a/?)2.-(a/?))l9 


l3  =  2 


(/?3-^+(cr^).3-(a"/?))l6 


(  (a/3)ii  -  (a"^)i .  -  (a/3)2i  +  (a~^)2.  )l3 

((a/3)i2-(cr^)i.-(a/?)22+(«"^)2.)l3 

((a/3)i3-(cr^)i.-(a/?)23+(a"^)2.)l3 

(-(a/?)ii  +  (cr^)l.+(a/?)2i-(a~^)2.)l3 

(  -  (a/3)i2  +  (cr^)i .  +  (a/?)22  -  (a"^)2  .  )l3 

(  -  («^)l3+  ("~^)l.  +  («/?)23-  («~^)2.  )l3 


where 


Oj  +02 


/?  = 


/?1  +  /?2  +  ^3 


2     3 


i=i.i=l  -^ 


(a^)i.= 


^11^     ^12^     ^13  ;   _  1     o 

Q  5  1    —   1  ,  ^ 


ay3    .+a/?    . 
(a/?).j=-^^-2 ^.      j  =  l,2,3. 
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Table  3.7    Some  useful  quantities  used  in  the  construction  of  the 
exact  tests  for  the  variance  components  utilizing  the 
methodology  developed  in  Section  3.3.3 


Quantity 


Formula   cited 


Value 


c 
n 

be 
b. 


'»4 
«4 

«6 


(3.189) 

(3.189) 

(3.190) 

(3.192) 

(3.193) 

Table   3.3 

Table   3.3 

Table  3.3 

Table  3.3 
Khuri(1990) 
Khuri(1990) 
Khuri(1990) 
Khuri(1990) 
m4=:  rank(P4) 
m5  =  rank(P5) 
nig  =  rank(Pg) 
m^  =  rank(P7) 
(3.217)(or  Table  3.3) 
(3.217)(or  Table  3.3) 
(3.217)(or  Table  3.3) 


7 

4 

(i,j,k) 

18 

40 

6 

3 

2 

1 

J6®(l3-J3) 

(12-^2)^^3^(13-^3) 
J2®(l3-J3)®(l3-J3) 

(Il-J2)®(l3-J3)«)(l3-J3) 

2 
2 

4 
4 


6(tI  +  3al  +  2al  +  <t? 


2<Tg  +  CTj 
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Table  3.7  —  continued 


Quantity 


Formula  cited 


Value 


h 

(3.217)(or  Table  3.3) 

J/ri 

(3.232) 

•/r? 

(3.232) 

'^r ,  max 

(3.237) 

SSr-4  =  ^4^4 

(3.124) 

SSrs  =  w^wg 

(3.124) 

SSre  =  vWt 

(3.124) 

SSj,y   =   ^7^7 

(3.124) 

^^^r2 

(3.235) 

0-7 

12 
10 
5/6 

87.816 

2.885 

134.599 

26.825 

31.5 


Table  3.8    Expected  mean  square  values  for  the  cloth  example 


Source 


C 

A*C 
B*C 

A*B*C 
"Error' 


Mean  Square 


MSr4  =  SSr4/2 
MSr5  =  SS^5/2 
MSre  =  SSj.6/4 
MSj,7  =  SSj,7/4 
MSEj,2  =  SSEr.2/10 


Expected  Value 


6<xl  +  3<Tl  +  2al  +  a^  +  ^(T\ 


6"e 


3cr^  +  (T7  -f  g<7j 

_2    ,5^2 


Lr2 

6^£ 


Table  3.9 


Source 


c 

B*C 
A*B*C 
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Results  from  the  analysis  of  the  variance  components 
for  the  cloth  example 


F-value 


P-value 


(exact  test  does  not  exist) 
MSr5/MSr7  =  0.215 
MSr6/MSi-7  =  5.018 
fCMSr-y/MSEj-j)  =2.555 


0.8152 
0 . 0737 
0.1044 


Choosing  L'=(l,0i7),  |=0,  and  recalling  Table  3.3,  we  see  that  to 
test  Ho:(/i  +  a  +  ^+(c^))  =0  we  form  (see  (3.286)) 

_SSHo/l 

Hq-  ms^4  ' 

where,  recalling  (3.265)  and  (3.268), 


MS.  =y  Vy.  72  =  49.204. 


In  this  case,  then,  we  find  F„  =366.2  with  p-value  =  0.0027  (based  on 

"0 

1  and  2  degrees  of  freedom) . 

To  test  EQ:a^  +  {aJ3)^^  =a2+(aj3)2.    we  would  take  L' =  (1  jOg,  -  1  jOg) 
and  i=0.      The  appropriate  test  statistic  would  be 


^   _SSHi/l 
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Table  3.10   Some  key  quantities  used  in  the  construction  of  the 
exact  tests  for  the  fixed-effects  utilizing  the 
methodology  developed  in  Section  3.3.4 


Quantity 


Formula  cited 


Value 


Do 

m, 

Do 


"3 

''fl 

A 


f  ,max 


SSHp 


SSH, 


SSH, 


SSH. 


Table  3.3 

Table  3.3 

Table  3.3 

Table  3.3 
Khuri(1990) 
Khuri(1990) 
Khuri(1990) 
Khuri(1990) 
mQ  =  rank(Po) 
mj  =  rank(Pj) 
mj  =  rank(P2) 
mg  =  rank(P3) 
(3.217)(or  Table  3.3) 
(3.217)(or  Table  3.3) 
(3.217)(or  Table  3.3) 
(3.217)(or  Table  3.3) 

(3.264) 

(3.265) 

(3.283) 

(3.283) 

(3.283) 

(3.283) 


18 
9 
6 
3 

(I2-J2)®J9 
J2®(l3-J3)«)J3 

(I2-J2)®(I3-J3)®J3 
1 

1 

2 
2 


6al  +  Sal  +  lal  +  a} 
3^2  +  ^2 

2(Tg  +  (Tj 


18 
1 

18018.570 

5.701 

113.492 

39 . 666 
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Table  3.11   Summary  of  results  for  the  cloth  example 


Source 


Test  Statistic 


p-value 


mean 

A 

B 
A*C 

C 
A*C 
B*C 
A*B*C 


SSHo/MS_f_j  =  49.204 
SSHi/MS^5  =  8.838 
SSH2/2MS^g  =  1.479 
SSH3/2MS^7  =  2020 

f1 =  1 . 547 
MSr5/MSj,7  =  0.215 
MSr6/MSr7  =  5.108 
|MSr7/MSEj.2  =  2.555 


0 . 0027 
0.0970 
0.3305 
0.2476 
0.3556 
0.8152 
0 . 0737 
0.1044 


t Based  on  Satterthwaite' s  procedure. 


where 


MS^5  =  y'f5y^5/2  =  0.645. 

Thus  Fg  =8.838  with  a  p-value  of  0.0970.   The  degrees  of  freedom 
associated  with  this  test  were  1  and  2. 

Similarly  to  test  Hq:/?j  +  (a/?)  .  j  = /?2  +  (<*/^)  .2  — /^3+ C'*^)  .3  ^^  take 
for  our  L  matrix 


L  = 


(1,0^,-1, Oil)' 
(l,Oli,-l,0^) 


(rank(L)  =2) 
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Also  i  =  (0,0)'.       In  the  present  case  the  value  of  the  test  statistic 


IS 


SSH2/2 


with  a  significance  level  (based  on  2  and  4  degrees  of  freedom)  of 
0.3305. 

Finally,  to  test  Hq:  ((^)  .  j  -  (c^)  .  _  -  (a/3)     ■  + (cT/S)  _   =0  for  all  i 

and  j ,  we  form 

SSHo/2 
Fr  =  Mg    =2.020 

with  a  p-value  of  0.2476  (based  on  2  and  4  degrees  of  freedom). 
Table  3.11  summarizes  our  results. 

Performance  of  machines.   The  data  in  Table  3.12  will  be  used  to 
illustrate  the  methodology  developed  in  Section  3.3.7.   These  data 
represent  performance  values  for  three  machines.   Two  power  settings 
(Factor  A),  three  machines  (Factor  B) ,  and  three  stations  (Factor  C) 
were  selected  and  either  two  or  three  runs  were  made  under  each 
setting  (see  Montgomery  (1984,  pp.  377-378)].   Ve  assume  that  Factor  A 
is  fixed  while  Factors  B  and  C  are  random.   The  appropriate  model  is 
given  in  (3.260). 

The  development  of  tests  for  the  variance  components  follows 
similarly  to  that  given  in  the  previous  example  and  thus  we  only 
summarize  our  results.   This  summary  can  be  found  in  Table  3.13.   We 
note  that  to  test  for  a  significant  B  or  C  effect  we  must  use 
Satterthwaite's  procedure  (see  Table  3.4). 
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Table  3.12   Performance  values  for  three  machines 


Machine 


Station 12    3 1    2    3 12    3 

34.1  33.7  36.2      32.1  33.1  32.8  32.9  33.8  33.6 

Power        30.3  34.9  37.1      33.5  33.9  35.1  33.1  32.8  32.8 
Setting  1    31.6  35.0           34.0      34.3  31.7 

Power        24.3  28.1  25.7      24.1  26.0  27.1  24.2  27.4  24.7 

Setting  2    26.3  28.6  26.1      25.1  27.1  23.9  26.1  28.0  22.0 

24.9      27.9  23.9  25.3      24.8 

(source:   Montgomery  (1984,  pp.  377-378)1 


Ve  next  consider  the  fixed-effects.   Recalling  Table  3.4, 
Theorem  3.10,  (3.318),  and  (3.321),  we  see  that  no  exact  test  exists 
(based  on  our  proposed  procedures)  for  testing  BqIO-^—  02-      In  this 
example,  the  derived  model  is  model  (3.260)  without  the  "f  j  ^i^g"  term. 
Thus,  the  fixed  subscripts  consist  of  just  i,  the  nonnested  random 
subscripts  consist  of  j  and  k,  and  there  are  no  nested  random 
subscripts.   Consequently,  Sj  =  1 ,  S2  =  2,  83  =  0,  s-l=3.   We  also  have 
3,^  =  2,    a2  =  a3  =  3.   Furthermore, 


s-1 


n   a:=a,a,  =  9  (see   (3.329)) 


i=Si+l 
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Table  3. 

,13 

Results 

from  the 

analysis  of  the 

variance  components 

for 

the 

machine  example 

Source 

df 

MS 

F-value 

p-value 

B 

2 

14.023 

2.2209^ 

0.6889 

A*B 

2 

11.801 

0.7582 

0.5256 

C 

2 

18.629 

0.7552^ 

0.6195 

A*C 

2 

30.155 

1.9375 

0.2580 

B*C 

4 

10.077 

0 . 6475 

0.6580 

A*B*C 

4 

15.564 

8.9481 

0 . 0024 

"Error" 

10 

1.739 

'Based  on  Satterthwaite' s  Procedure 


d2  =  l 


(see  (3.330)] 


min  a-  =  3 

Sj+1  <  i  <  Sj+Sj 


(see    (3.371) 


Ao  =  {?^:?^  =  qi2'     i<q<3} 


(see    (3.370) 


%  =  iis;    ^1  =  I2®l9 


(see    (3.334)). 


The  li^  vectors  (see  (3.338))  are  restricted  to  £  =  2,3  and  to  ig  =  q 
(q  =  1,2,3).   Therefore,  these  vectors  are  (since  a2=a3  =  3) 


1^=(1,0,0)',  l2=(0,l,0)',  il=(o,o,iy. 
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Table  3.14   Some  useful  quantities  needed  in  the  development  of  the 
exact  confidence  intervals  for  the  machine  example 


Quantity 

if=(2,2)' 
t/=(3,3)' 

(^=(1,1)' 
t/=(2,2)' 

i/€Ao 

?^eAo 

Nn 


Formula  cited 


Ho^ 


Ho^ 


Ho^ 


Hr 


H 


nr 


pr 


'00 


N, 


01 


'10 


ni 


N, 


Co* 


Cj 


Bi 

dlC*Nig 


;3.338) 
:3.338) 
;3.338) 
;3.338) 
;3.338) 
;3.338) 
;3.347) 
:3.347) 
:3.348) 
^3.348) 
;3.351) 
;3.351) 
:3.351) 
:3.351) 
;3.353) 
;3.353) 
;3.356) 
:3.356) 
[3.366) 
;3.366) 
Theorem  3.11 
Theorem  3.11 


Value 


I2®  I3®  I3 


IjOlg®!! 


I2®  ii^U 


I2®  13®13 


Ij®  I30I3 


H^Hf/9-HX7l8 
QPgQ' 
QPrQ' 

I2/9 

J2/I8 

0 

I2/9-J2/18 

(2l2:J2)/18 
(0:2l2-J2)/18 

(1,0) 

(1,0) 

J2/I8 
I2/9-J2/I8 
ifi  +  a)/9 

(«i-«2)/18 


Table  3.14  -  continued. 
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Quantitv 
djCoBoCg 
d2Cj  BjCj 


Formula  cited 
Theorem  3.11 
Theorem  3.11 
(3.375) 
(3.375) 
(3.374) 
(3.374) 


Value 

1/18 

1/18 

3.5281 

0.7491 

0.5185 

0.6192 


Some  key  quantities  used  in  the  development  of  the  confidence 
intervals  given  in  (3.379)  are  listed  in  Table  3.14.   In  this  case, 
the  space  of  estimable  functions  of  the  fixed-effects  parameters  is 
spanned  by  CoNog  and  C*Njg,  or  equivalently  by 

ft  +  a,    Oj  -  ttj   (see  Table  3.14) 

where  a=(aj+a2)/2.   In  Table  3.15  we  provide  exact  957o  confidence 
intervals  for  fi  +  a   and  Oj— aj  along  with  approximate  ones  based  on 
Satterthwaite' s  procedure.   To  apply  Satterthwaite' s  procedure  we 
note  that  (see  Lemma  3.10  and  Theorem  3.10) 


u,o~N(>fT8(/i  +  5),  6o  +  t) 
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Table  3.15   Exact  and  approximate  957.  confidence  intervals  for  the 
machine  example 


Estimable  function 


fi  +  a 


"l-"2 


Exact 
confidence  interval 

(20.2696,  43.1367) 

(-11.3968,  38.3637) 


Approximate 
confidence  interval 

(24.9762,  36.3594) 

(-31.8117,  12.9087) 


Wi~N(^(Ql-a2),  <5i+^) 

where  ^q  and  ($j  are  given  in  Table  3.4.   Thus,  an  unbiased  estimator  of 

fi  +  a   is  Wq/-^T8  and  an  unbiased  estimator  of  a^—a2   is  2w^/^18.   From 

2  2 

Table  3.4  we  see  that  unbiased  estimates  of  (6q  +  ^)/18   and  2((5j -|--^)/9 

are,  respectively, 

i(MS2  +  MS4-MS6)  =  1.2541 
|(MS3  +  MS5-MS7)  =5.8648. 


As  a  final  remark  we  note  that  since  there  is  a  large  A*B*C 
interaction,  any  conclusions  which  one  reaches  in  regards  to  the 
other  effects  in  the  model  must  be  done  so  cautiously. 
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3.3.9   Concluding  Remarks 

In  this  section  we  analyzed  the  general  unbalanced  mixed  model 
with  imbalance  affecting  the  last  stage  only.   We  approached  the 
problem  of  obtaining  exact  tests  for  the  variance  components  and  for 
estimable  linear  functions  of  the  fixed-effects  parameters  in  two 
different  ways.   In  Subsections  3.3.3  and  3.3.4  we  utilized 
resampling  to  first  construct  exact  tests  for  the  variance  components 
and  then  to  develop  exact  and  approximate  tests  for  the  fixed-effects 
parameters.   With  the  exception  of  the  exact  test  for  a^,  our  exact 
tests  for  the  variance  components  and  the  fixed-effects  (when  they 
exist)  are  Vald's  tests.   This  is  true  since  the  degrees  of  freedom 
associated  with  these  tests  match  those  of  the  corresponding  tests 
from  the  balanced  data  situation.   A  drawback  to  this  approach  was 
the  need  to  use  two  different  transformations  to  arrive  at  our  exact 
procedures.   An  interesting  side  note  to  our  methodology  was  the  use 
of  resampling  as  a  precursor  to  utilizing  Satterthwaite' s  procedure. 
Instead  of  (wrongly)  applying  Satterthwaite 's  methodology  to  sums  of 
squares  which  were  not  independent  nor  distributed  as  scaled  chi- 
squared  variates,  we  first  used  our  resampling  scheme  to  transform 
the  model  into  one  in  which  the  required  assumptions  were  met,  and 
then  (correctly)  applied  Satterthwaite' s  procedure. 

In  Subsection  3.3.7  only  one  transformation  was  needed  to  arrive 
at  a  set  of  sums  of  squares  which  were  independent  and  distributed  as 
scaled  chi -squared  variates.   These  sums  of  squares  mimicked  their 
balanced  counterparts.   However,  the  resulting  exact  tests  were  seen 
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to  be  less  powerful  (in  general)  than  the  exact  tests  developed  in 
the  two  previously  mentioned  subsections.   This  led  us  to  recommend 
the  use  of  those  tests  when  most  of  the  cell  counts  are  much  larger 
than  the  minimum  cell  frequency. 

Because  the  degrees  of  freedom  associated  with  the  exact  tests 

for  the  variance  components  o-^_p+i  5<^^_p+2' '  '  " ''^^-l '  ^"^  '^°^   those 
fixed-effects  parameters  in  which  there  existed  a  proper  mean  square 
that  could  serve  as  an  "error"  term  in  an  F-ratio,  match  the  degrees 
of  freedom  associated  with  the  corresponding  exact  tests  in  a 
balanced  mixed  model,  we  can  conclude  that  the  exact  tests  developed 
in  Subsection  3.3.7  are  also  Vald's  tests.   In  cases  where  no  error 
term  existed,  we  utilized  the  results  in  Khuri  (1984)  and  constructed 
exact  simultaneous  confidence  intervals  on  linear  functions  of  the 
fixed-effects  parameters. 

Finally,  although  not  pursued  in  this  dissertation,  it  would  be 
very  simple  to  construct  simultaneous  confidence  intervals  for 
continuous  functions  of  the  variance  components  by  employing  the 
methodology  presented  in  Khuri  (1981).   This  may  be  preferable  in 
those  situations  where  the  only  alternative  is  to  use  Satterthwaite' s 
procedure  (or  some  other  approximate  procedure)  since  the  exact 
converage  probabilities  associated  with  these  (approximate)  intervals 
are  typically  unknown. 
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3.4  The  Random  Unbalanced  Nested  Model  with  Imbalance 
Affecting  the  Last  Two  Stages  Only 


3.4.1    Introduction 

In  this  section  we  extend  the  results  of  Khuri  (1987)  by 
considering  the  random  s-fold  unbalanced  nested  model  with  imbalance 
affecting  the  last  two  stages  only.   Ve  again  utilize  resampling  to 
make  transformations  in  a  "step-down"  manner  to  arrive  at  our  exact 
tests.   This  approach  is  similar  to  the  approach  employed  in  Section 
3.2.   Ve  assume  all  cells  contain  at  least  one  observation. 

After  some  preliminary  development  we  proceed  to  the 
construction  of  the  exact  tests  for  the  variance  components  in  the 
model.   Furthermore,  we  show  that  our  tests  reduce  to  the  usual  ANOVA 
tests  when  the  design  is  balanced.   Also,  we  develop  expressions  for 
the  power  associated  with  the  tests.   A  numerical  example  is  given  to 
illustrate  our  proposed  methodology. 

3.4.2.   Notations  and  Preliminary  Development 

In  our  development  we  will  concentrate  on  the  following  model 
(the  notation  being  similar  to  that  of  the  previous  section), 


ye^^,  =  '^+y,+%-^---+ys^%^. 


(3.380) 
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where   the   sets   6-   are   defined  as 


?j  =  (ki,k2,---,kj),    1<  j<s+l  (3.381) 


and  (since  the  design  is  balanced  except  for  its  last  two  stages) 

/  1,2, •• •  ,aj,   j  =  1,2, • •  •  ,s-l 

k.=-^  l,2,...,m.   ,   j  =  s  (3.382) 

-J   I  s-1 

1,2,  ■  ■  ■  ,nfj    ,      j  =  s+1. 
"s 

In  (3.381),  9-   represents  the  subscripts  associated  with  the  j 


effect  in  model  (3.380),  j  =0,1,  •  •  •  ,s+l;  (when  j  =0,  ^i=<^   and 

y  ,  =  u)  .      Furthermore  we  will  assume  that  T/i  ,t<i  ,  •  •  •  ,1fa    ,  and  €a  are 

2  "s  ''s+l 

independently  distributed  with   7^    ~N(0,    o"^-),    l<j<s;    and 


j 

eg        ~N(0,  <rl). 
s+1 

In  matrix  notation  (3.380)  becomes 


y  =  /iln  +  Xi^i+---+Xs^s  +  e,  (3.383) 


where  y  is  the  vector  of  observations;  €  is  the  vector  of  errors;  /?  • 
is  a  random  vector  consisting  of  the  elements  of  7^   (1<  j<s);  and 
X-  is  a  matrix  of  zeros  and  ones  (l<j<s)  written  as 


X.=  eln^  ,  l<j<s  (3.384) 


where 


n=  En.  (3.385) 

a      ■'s 
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H    =       En.,    l<j<s-l,  (3.386) 


where  ^s  ~  ^  i  =  (^  i  +  l '^  i+2' '  '  ' ''^s)  •      ^^^   "^  also  define 


wh 


m=E  nifl  (3.387) 

^s-1     -1 


m.    =  E        mfl        .    l<J<s-2,  (3.388) 

ere   ^s-1 ~^ i ~  C*  i+l '^ i+2' "  '  " '^s-l) "      ^^^   matrix   X-    is   of   order 


nxc-,    l<j<s,    where 


J 
'.Ha.,    j  =  l,2,-  •  •  ,s-l 
1=1    J 


c.=<--  .  (3.389) 

J       I     m,  J  =  s. 


The  assumptions  made  earlier  concerning  the  random  effects  can  now  be 
restated  as: 

(i)   /?i,/?2>  ■  •  •  5/^s'  ^"^  i   ^^^   independent  and  normally  distributed 
random  vectors  with  zero  mean  vectors  and  variance- 
covariance  matrices  given  by 

Var(^.)=<T|lc.,  i  =  l,2,---,s 

(3.390) 

Var(e)=<^^In- 
From  (3.383)  it  is  thus  easy  to  verify  that 
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E(y)=/iln 

(3.391) 

,2  Y  y'  j^  ,»2t 


Q  =  Var(y)  =  E^iXiX'i+<T^In. 
i=l 


Besides  the  usual  assumptions  concerning  the  random  effects  we 
will  also  assume: 

(i)   The  model  used  is  model  (3.380)  (or  model  (3.383)). 

(ii)   n>2m-l.  (3.392) 

(iii)  m>2Cg_j-l.  (3.393) 

The  last  two  assumptions  are  needed  for  technical  reasons  to  insure 

the  validity  of  our  methodology.   Neither  is  overly  restrictive.   For 

example,  (3.391)  will  hold  if  n^  >2  for  all  kj,k2,---,ks  while 

''s 

(3.392)  will  hold  if,  for  example,  m^    >2  for  all  k^  ,k2,  •  •  •  ,kg_-^ . 

s-1 


3.4.3   An  Exact  Test  for  (t| 


Introduction.   In  this  section  we  develop  an  exact  test  for  the 
hypothesis  HQ:<r|  =  0  vs  E^iOTg^O.      This  test  is  similar  to  the 
corresponding  test  in  a  balanced  model  in  that  it  compares  the  sum  of 
squares  associated  with  the  jg      effect  to  the  error  sura  of  squares  in 
model  (3.380). 

Preliminary  development.   To  begin  with  let 


Vj  =  [ln  =  Xi:---:Xj],   l<j<s.  (3.394) 
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Then,   momentarily  treating  /?j  ,/?25  •  •  •  ,^s  ^^  fixed,  we  define  the  sum 
of  squares  Ry(^s I ^1 '  '  "  " '^s-l)  ^® 

Ry(^sl^l.  •  •  •  .^s-l)  =y'Cln- Vl(Vs-lVs-l)"V's_i]y 

-y'[In-Ws(W'sVs)~V'3]y.  (3.395) 

Since 

range(ln) C  range(Xj)  C  ■  •  •  C  range(Xs) ,  (3.396) 

we  can  express  (3.395)  as 

RyC^sl^i.  •  •  •  '^s-l)  =y'CPs-Ps-l]y'  (3.397) 


where 


Pj=Xj(X'jXj)-iX'j,   j  =  l,2,---,s.  (3.398) 


(From  (3.384)  it  is  clear  that  X-  has  full  column  rank  c-, 

j  =  1,2,  •  •  •  ,s-l ;  and  Xg  has  full  column  rank  m.j  Ryd^s  I '^i '  '  '  ' '^s-l^ 

can  be  interpreted  as  the  sum  of  squares  for  ^g  adjusted  for 

/?j,  .  .  •  ,/3g_j  (see   Searle  (1971)  for  a  description  of  the  R(  •  |  •  ) 

notation]. 

The  error  sum  of  squares  associated  with  model  (3.380)  (or  model 
(3.383))  is  given  by 

SSEy  =  y'(In-Vg(V'gVg)-V'g)y 
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=  y'(In-P8)y-  (3.399) 

The  last  equality  follows  from  (3.397).   The  following  theorem 
provides  the  necessary  tools  to  construct  our  exact  test. 

Theorem  3.12 

(a)   RyC^sl^i' •  •  • '^s-l)  ^^  independent  of  SSEy  and,  under  HQ:(r|  =  0, 
is  distributed  as  cr^X^-c 


's-1 


(b)      SSEy/<r2~x^ 


2  _  ,,2 

-ra* 


Proof: 


(a)   We  first  verify  that  Ry(^s I ^i '  •  '  • >^s-l)  ^^  independent  of  SSEy. 
To  verify  this  we  must  show  that 

(Ps-Ps_i)"(ln-Ps)=0.  (3.400) 

In  order  to  show  (3.400)  we  need  the  following  easily  derived 
results: 

(i)   X.X'.Pj  =  X.X'.  =PjX.X'.,   l<i<j<s  (3.401) 

(ii)  .  Pj  =  Pj'   j  =  l,2,---,s.  (3.402) 
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(iii)   PjPi=Pj  =  PiPj,      l<i<j<s.  (3.403) 

For  example  to  show  (3.401)  note  that  range(XjX'^)  C  range(Pj)  for 
i<  j  and  equals  range(P  •)  for  i=j.   Thus  there  exists  a  matrix 
X|  •  such  that 


Vi=Pj^ij 


=  X^'-P-      (by  symmetry) 


Consequently, 


X-X'^P- =X|'.P.   (by  (3.402)  which  clearly  holds) 


=  XiX'i 


=  PjX^X'^  (by  symmetry). 
Formula  (3.403)  can  be  shown  in  a  similar  fashion. 
Now,  (3.401)  implies  (see  (3.391)) 

n(I„-Ps)  =  (In-P3)<T2  (3.404) 

while  (3.402)  and  (3.403)  imply 
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(Ps-Ps-l)(In-Ps)=0- 


(3.405) 


Thus  (3.400)  is  verified  and  consequently  SSEy  is  independent  of 

Ve  now  show  that  Ry(^s  l^l»  '  '  ' '^s-l)/<^€  ~  ^m-Cg_^  «hen  4  =  0- 
To  do  so  we  need  to  verify  that  when  o-|  =  0  (Ps -Pg-l^^/'^f  ^'^ 
idempotent;  and  rank(Ps  -  Pg.^)  =  m- c^j .   Now,  from  (3.391)  and 
(3.401)  we  have 


(Ps-Ps-l)"=(Ps-Ps-l) 


i  =  l 


=(.fi/iXin+-^Ps)-Ci:/lXiX'i 
^1=1  1=1 

+  4Ps-iXsXs  +  <^|Ps-i) 


^XsX's  -  ^Ps-l^sX's  +  <^e(Ps  -  Ps-1 ) 


cr|P3X3X'3-4P3_iX3X'3  +  <72(Pg-P3_l)(see    (3.401)) 


=  (Ps-Ps-l)[4XsX's  +  '^^l]- 


(3.406) 


Thus  when  Hq:<t|  =  0  holds,  (Pg "  Ps-l)^/'^f  becomes 


(Ps-Ps-l)"/'^e  =  Ps-Ps-r 
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Since  Pg-Pg-l  is  idempotent  (from  (3.402)  and  (3.403))  and 
clearly  has  rank  f  — c   ..,  it  follows  that  when  HQ:CTg  =  0  is  true 


2    „2 


Ry(^sl^l.---'^s-l)/'^e~^m-Cg_i 


(b)      Recalling  from   (3.404)    that 

fi(In-Ps)/<T|  =  In-Ps 

and  using  the  fact  that  In~Ps  ^^  idempotent  and  has  rank  n— m, 
it  follows  from  (3.399)  that 


SSEy/a^  ~  Xi 


0 


The  exact  test.   From  Theorem  3.12,  a  test  statistic  for  testing 
HQ:cr|  =  0  vs  Hg^icrl^O  is  given  by 


Fs- 


/[Ps-Ps-i]y/("'-s-i) 

SSEy/(n-m) 


(3.407) 


Under  Hq,  Fs ~ ^m-c    ,n-m.   Under  Hg^  the  distribution  of  Fg  i 
complicated.   However,  we  still  have 


IS 


y'(Ps-Ps-i)y 


'S-l 


=  iiI3^tr[(P3-P^_l)Q]  +  /iVn(P3-Pg_i)lJ.     (3.408) 


Now,  since  range(ljj)  C  range(P  ■)  and  P-  is  a  projection  matrix  (recall 
(3.398)]  we  have  that 
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'jln^ln.   j  =  l,2,---,s. 


(3.409) 


Thus 


ln(Ps-Ps-l)ln  =  0. 


Hence,  from  (3.406)  and  (3.409), 


tr[(Ps-Pg_l)fi]=<T?(m-c^_l)+4tr[X'3(P3-P3_i)X3] 


„  m 

'^?(m-C3_i)-f4  Ex'sj(Ps-Ps-l)^sj 

1=1  -^  -^ 


where 


•^s  ~  L^si  •  -^82  •  •  ■  ■  •  ^sin  J  ' 


Therefore, 


'y'(Ps-Ps-i)y 


m-c 


s-1 


2  ,   "s 


(^;  + 


e   "^  m-c. 


Y.Ex;j(Ps-Ps-i)x,j><^: 


(3.410) 


Consequently,  we  would  reject  Hq  for  large  values  of  Fg  in  (3.407). 
Note  that  resampling  was  not  needed  to  construct  this  exact  test. 
We  now  develop  an  exact  test  for  crg_-.. 
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3.4.4   An  Exact  Test  for  <r^_-^ 

Introduction.   In  this  section  we  develop  an  exact  test  for  the 
hypothesis  HQ:crg_j^=0  vs.  Ha^:<r^_-|^  ^  0.   In  the  balanced  data  situation, 
we  would  simply  compare,  aside  from  a  constant,  Ry(^s_i I  .^i  > '  '  ' 5^s_2) 
to  RyC^sl^l'-  •  -'^s-l)  (treating  Pi,  ■■■,0s   ^^   fixed)  where 

Ry(^s-ll^l'---'^s-2)=y'tPs-l-Ps-2]y' 

(see  (3.396)  and  (3.398)).   Unfortunately,  when  the  data  are 
unbalanced,  we  can  not,  in  general,  show  that 

(Ps-l-Ps-2)"(Ps-Ps-l)=0 


unless  we  assume  (t|  =  0.   This  follows  since 


(Ps-l-Ps-2)"(Ps-Ps-l)=4(Vl-Ps-2)M's(Ps-Ps-l)' 

(recall  (3.391)  and  use  (3.401)  -  (3.403)V   Thus  these  two  sums  of 
squares  will  not,  in  general,  be  independent  unless  we  (unreasonably) 
assume  <r|  = 0 .   Consequently  a  different  approach  is  necessary  to  test 
this  hypothesis.   We  might  try  comparing  the  sum  of  squares  for  l3g_i 
adjusted  for  all  the  other  effects  in  the  model  (treating  them  as 
fixed)  to  Ry(^sl/'l' ■  ■  • '^s-l)-   Unfortunately,  the  former  sum  of 
squares  can  be  shown  to  zero  (see  (3.396)1. 
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The  idea  we  develop  is  based  on  a  simple  observation  that  was 
illustrated  in  Section  3.4.2: 

For  any  random  nested  model  (balanced  or  unbalanced)  we  can 
construct  an  exact  test  for  the  variance  component 
associated  with  the  penultimate  stage  by  comparing  the  mean 
square  for  the  corresponding  effect  (adjusting  for  all  the 
effects  preceding  it  in  the  model)  to  the  error  mean  square 
for  the  model. 

Thus  to  test  HqIO-I  I  =0  vs.  Uq^:<t^_^^0   we  will  make  a  transformation 
(utilizing  resampling)  that  will  reduce  model  (3.380)  for  (3.383)j  to 
one  that  mimics  a  random  (s-l)-fold  nested  model.   In  the  transformed 
model,  c^_i  will  be  the  variance  component  associated  with  the 
penultimate  stage. 

Preliminary  development.   Define  the  CgXn  matrix  D  as 


D=  ©rr^l'n 


(X'sXs)-'X'3.  (3.411) 


Also,  let 


y  =  Dy.  (3.412) 
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Furthermore,  define 


Aj=©lm-,  .    j  =  l,2,...,s-l.  (3.413) 


The  matrix  A-  is  or  order  mxc-  and  of  full  column  rank  c- 
J  J  J 

(j  =  1,2,  •  •  •  ,s-l).   From  (3.384)  and  (3.396)  we  can  show  that 


1   —  X  1 
in  ~  '^sim 

(3.414) 

Xj=XgAj,    j  =  1,2,  •  •  •  ,8-1. 

Consequently,    (3.412)    can   be  expressed  as 

y  =  /^ira  +  Ai^i+A2^2+  •  ■  •  +As-i^s-l  +  Im^s+I  (3.415) 

where  T  =  De  .      We   note   that 

range(l^) C range(AJ  c  •  •  •  C range(Ag_^) .  (3.416) 

This  follows  since  (writing  Xq  =  1^  and  Aq  =  1„)  ,  for  0<i<j<s-l, 

range(A.)  =  range (XgAj^)       (Xg  has  full  column  rank) 

=  range(Xj)         ^see  (3.414)) 

Crange(Xj)         ^see  (3.396)) 
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=  range(XsA.) 


range(Aj). 


From  (3.415)  it  is  easily  seen  that 


E(y)=/iij 


(3.417) 


E  =  Var(y)  =^j:,T\A.A'.+all^  +  Kal 
i  =  l 


where 


K=(X'sX3)-i=en^i.  (3.418) 

^s  ^ 


Now,  let  Hj  consist  of  the  last  m-1  rows  of  the  m x m  Helmert  matrix 
^see  Searle  (1982,  p.  71)j.   Then  Hj  has  the  following  properties: 

(i)   H,Hl  =  Vi. 

(ii)   HiHi  =  I„-J„.  (3.419) 

^Recall  that  Jni  =  ffilmim')  Define  the  (m-l)xl  random  vector  p 


as 


u  =  Hiy  (3.420) 
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The  purpose  of  this  transformation  is  to  cause  u  to  have  a  zero  mean 
vector  as  illustrated  below  fsee  (3.417)1. 


E(u)=HiE(y)=pHil„  =  0^_l 


Var(u)=E^HiA.H;,7|+4lm-l+"i'^«i'^€. 
i  =  l 


(3.421) 


where  A.=A-A'-,  i=l,2,''',s-l. 

Utilizing  resampling.   Let  R  be  the  nxn  matrix  associated  with 
error  sum  of  squares  in  model  (3.380).   That  is,  from  (3.399), 


R=In-Ps-  (3.422) 

The  matrix  R  is  symmetric,  idempotent,  and  has  rank  n-m  (see  Lemma 
B.4).   Ve  can  write  R  as 

R  =  CAC'  (3.423) 

where  C  is  orthogonal  and  A  is  diagonal.  The  first  n-m  diagonal 
entries  of  A  are  ones  while  the  remaining  m  diagonal  entries  are 
zeros.   Partition  C  and  A  as 

C  =  [Ci:C2:C3J 

(3.424) 
A  =  diag(I^^,I^^,0) 
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where 

r?!  =m-l 

(3.425) 
rjj  =  n-2m+l 

and  CjjCj,  and  C3  are  of  orders  n  x  tJj  ,  nXTjj?  and  nxm,  respectively. 
Note  that  rj^ -I-tJj  =  n-m  =  rank(R)  and  r?2  >  0  by  (3.392). 

Ve  now  introduce  a  transformation  that  will  result  in  a  random 
vector  which  has  a  diagonal  variance-covariance  matrix.   Let  w  be  the 
(m-l)xl  random  vector  defined  as 


1 
V  =  i}  +  (^maxl7?i-L)2C'iy  (4.426) 

where 

L  =  HiKHi  (4.427) 


1 

^^^  ^msix   is  its  largest  eigenvalue.   The  matrix  (^ma.x^r)   -L)2  is  well- 
defined  since  A^ax^7jj~L  is  p.s.d.   It's  eigenvalues  are  the  square 
roots  of  the  eigenvalues  of  ^jaax^r]   "L.   The  next  lemma  provides  some 
distributional  properties  for  y. 

Lemma  3.13 


The  random  vector  w  is  normally  distributed  with 
(a)   E(w)=g 


1  =  1 


(3.428) 
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Proof : 


First  note  that  w  is  clearly  normally  distributed. 

(a)  From  (3.421),  E(u)=u.   From  (3.398),  (3.414),  and  (3.422),  it 
follows  that  R1^  =  0.   Thus,  since  range(Ci)  C  range(R)  ,  it  must 

be  that  C^ljj=:0.   Consequently  the  result  follows  after  recalling 
the  definition  of  w  in  (4.426)  and  that  E(y)=/jljj. 

(b)  Ve  claim  that  u  is  independent  of  Cjy  in  (4.426).   To  see  this 
first  note  that 


DfiR  =  DR(r2       (see  (3.404)) 

=  0.    (see  (3.398),  (3.411),  and  (3.422)) 

Thus,  since  range ( C^ )  C  range (R)  it  follows  that  DQC^  =  0  =4>  y  is 
independent  of  Cjy.   But  this  implies  that  u  and  C'jy  are 
independent  since  u  is  a  function  of  y.   Consequently, 


Var(y)  =  Var(u)  +  (X^e.K\-'^rC[nC,iX^^^I^^-L)^ . 

Now,  from  (3.391),  (3.401),  (3.422),  and  the  fact  that  R  is 
idempotent,  we  have 


RQR  =  Raj . 
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Hence,  from  (3.421), 

1  =  1 
where  *  =  4  +  ^max<^e-  □ 

From  this  lemma,  (3.415),  and  (3.420),   we  can  write  w  as 

y  =  HA^i  +  HiA2^2  +  •  ■  •  +HiAs-l^s-l+^*         (3.429) 

1 
where  £*  =  Hj^g +H1I+ (An,^^!^^  -  L)2c;e  ~  N(0,5I^_-^)  independently  of 

'^i' ■  ■  ■ '^s-l*   Model  (3.429)  resembles  a  random  (s-l)-fold  nested 
model  (with  zero  intercept)  because  of  the  distributional  properties 
of  ^i''""»^s-l'  ^^'^   ^*'  ^"d  since  for  l<i<j<s-l, 


HjAj^  =HiAjA|j   ^for  some  matrix  A|  • ,  see  (3.416)] 


=^range(H^A-)  C range(HiA •) .  (3.430) 
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Before  we  proceed  to  the  development  of  our  exact  test  we  need 
the  following  result. 

Lemma  3.14 


(a)   rank(A'jH;HiAj)=Cj-l,    j  =  1 ,2,  •  •  •  ,s-l , 


(b)   A  g-inverse  of  A'-H^H^A  •  is  given  by 


(A'jHjHjAj)-  =  (A'jAj)-i,   j  =  1,2,  .  .  .  ,s-l, 


Proof : 


(a)       rank(A'.H;HiA.)  =rank(HiA.) 


J  ^^  J'   ^  1  J^ 


<  rank(A  •) =  c  • . 


However,  from  (3.413)  Ajl^  .  =  !„,.   Since  ^ilm  =  Qm_i   (see 


J 


(3.419)(iii)j,  the  columns  of  H^A •  are  linearly  dependent.   Thus 
rank(A'jH'iHiAj)  <Cj-l,   j  =:  1 ,2,  •  •  •  ,s-l .   Conversely,  by  the 
Frobenius  Inequality  (see  Lemma  A.l),  we  have 


rank(HiA.)  >  rank(Hi) -|-rank(A  •)  -  rank(I„) 

J  J 


m  —  1  +  c . — m 
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=  Cj-l,    j  =  1,2,  •  •  •  ,s-l, 


Thus,    rankCA'-HiHiAj)  =Cj-l,    j  =  1 ,2,  •  •  •  ,s-l , 

(b)      First  note   that  (see   (3.413))  for   j  =  1,2,  •  •  •  ,s-l, 

(Aj(A'jAj)-Vj)J„  =  (®4-J,^)j, 

J  J  J 

=  Jm  (3.431) 

=  J„(Aj(A'jAj)-U'j).  (3.432) 

Hence,    for   j  =  1 ,2,  •  •  •  ,s-l 

A'jHlH,A.(A'jAj)-A'jHiH,Aj 

=  A'j(In,-Jm)Aj(A'jAj)-U'j(I„,-J„)Aj   (see   (3.419)) 

=  (A'j-A'jJn,Aj(A'jAj)-i(I„-J„)Aj 

=  (A'j-A'jJ„)(I„-Jn,)Aj  (see    (3.431)    and    (3.432)) 

=  Aj(Im-Jm)Aj  (Im-Jra   is    idempotent) 

=  A'jHiHiAj.  D 
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We  now  develop  the  exact  test  for  <''s_i' 

The  exact  test.   Let  us  define  the  (m-l)x(in-l)  matrix  Q- 


(j  =  1,2, •  •  •  ,s-l)  as 


Qj=HiAj(A'jHlHiAjrHi 


=  HiAj(A'jAj)-iA'jHi.  (4.433) 

Furthermore  let  ^i^il^      -i]  P-^,  •  •  •  ,P      2)    ^^  *^®  ^"^  °^    squares  for  the 

/?  _.  effect  adjusting  for  /?i  5  "  "  '  j/'g.o  (treating  them  as  fixed)  for 

the  w  model  defined  in  (3.429).   Also  let  SSE^^  be  the  error  sum  of 

squares  for  model  (3.429).   Then 

V^s-ll^i'---'^s-2)=y'[Vl-Vs-2(V's-2V2rn-2]'^ 

-'^'[Vl-Vs-l(V's.iV3_irv'3_i]- 
and 


ssEy  =  .,'(Vi-v^_i(v'3_iV3_i)-v;_i)y 


where 


\.  =  [HiAjiHiAj:  •  •  •  :HiAj],  j  =  1,2,  •  •  •  ,s-l.        (3.434) 


From   (3.430),    Lemma  3.14(b),    and    (3.433),    ^uO^s-l^^l'  '  '  '  ■>^s-2^   ^"** 
SSE^^   can    be   equivalently   expressed   as 
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Ry(^s-ll^i'---'^s-2)=y'  [Qs-l-Qs-2]y  (3-435) 

SSEy  =  y'[Vl-Q3_l]y.  (3.436) 

The  major  result  of  this  section  is  given  in  the  next  Theorem  which 
provides  some  distributional  properties  for  Rj^(/?  _i  | /^i ,  •  •  •  ,/?  _o)    and 
SSEy. 

Theorem  3.13 

(a)  J^w(^s-ll^i' ■  •  ■ '^s-2)  ^^  independent  of  SSE^  and,  under 

^^n"'''c  1  =0>  is  distributed  as  Syi 
"   ^"^  ^s-l~^s-2 

(b)  SSEy/(5~Xm-c   ..  »  ('^  ^as  given  in  Lemma  3.13(b)y 
Proof: 

(a)   We  first  verify  that  RyC^s-l I ^l '  '  '  ' '^s-2)  ^"^  ^SE^  are 
independent  by  showing 

[«s-l-Qs-2]CVar(y)][I^_l-q^_^]  =  0. 

In  order  to  show  this  we  need  the  following  results,  which  are 
analogous  to  (3.401)  -  (3.403) . 

(i)   HiA.A'.Hiqj=HiA.A'.Hi  =  q.H,AjA'.Hi,   1  <  i  <  j  <  s-1    (3.437) 
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(ii)   Q^j  =  Qj,   j  =  l,2,---,s-l  (3.438) 

(iii)  q^q.  =Qj  =  Q.Qj,    l<i<j<s-l  (3.439) 

Formula  (3.438)  is  obviously  true.   To  show  (3.437)  note  that 
for  1  <  i  <  j  <  s-1 

range(HiA.A'.H;)  =range(HiA.) 

Crange(HiAj)  (see    (3.430)) 

=  range(qj).  (see    (3.433)) 


Thus  there  exists  a  matrix  B*  •  such  that 


HiVi«i  =  QjBij 


=  Bj.q..       (by  symmetry) 


Hence,  for  1  < i  < j < s-1 


=  Bl'.il.  (by  (3.438)) 


=  E,X.A'.E[ 
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=  Q jHjA. A'.H'i      (by  symmetry). 

Formula  (3.439)  follows  similarly.   Now,  from  Lemma  3.13(b),  we 
have 


Var(^)[V,-Q3_,]=(^f:\A.HXi+5Vj(I^_^-Q^_^) 


1  =  1 


s-1  . 


=  '^(Im-l-Qs-l)     («ee  (3.437)).   (3.440) 


Thus 


"s-l-Qs-2][V^^(y)][Im-l-Qs-l] 

=  *(Qs-l-«s-2)(Im-l-Qs-l) 
=  0         (by  (3.349)). 

Consequently,  \(^s_i  I  ^i ,  ■  •  •  ,^3,2)  and  SSE^^  are  independent. 

To  show  that  \(^s-l  l^l '  "  '  '  '^s-2)/'^  ^^^  ^  chi-squared 
distribution  with  Cg_j-Cg_2  degrees  of  freedom  under  Hq  :  cr^   =  0 , 
we  need  to  verify  that  (qg_  j  -  Qg_2)  (Var(w)  )/($  is  idempotent  (when 

'^s-l-^)  ^""^  '^s-l~^s-2  ^^^  ^^"^  ^s-l~'^s-2"  ^^^   ^^""^  L^ 
3.13(b)  and  (3.437),  we  have 


jemma 
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(Qs-l-qs-2) 


1  =  1 


s-1  . 


s-2  . 


.EH,AiH;c.|-fiQ^_l-(_EH,AiHi.?-f.4_,Q^_2H,A3_iH;  +  6Q3_2) 


=  '^s-lQs-lMs-lHi-4-lQs-2HiVlHl  +  HQs-l-Qs-2)(«««  (3.437)) 
=  («s-l-Qs-2)[-s-lHiAs-lHi  +  ^Vl]- 
Thus  when  Hq:<t2_^=0  holds,  (Qg_j  -  Qg_2)(Var(w))/5  becomes 

('3s-l-«s-2)(V^^(y))/'5  =  «s-l-V2 

Since  Qs_i-Qs-2  ^^  idempotent  Tfrom  (3.348)  and  (3.349))  and 
clearly  has  rank  c^_^-c^_2   (see  (3.433)  and  Lemma  3.14(a)),  it 
follows  that  when  EQ-.a-'t-.—O   is  true 


\(^s-ll^i'---'^s-2)/^~^c3_i-c 


s-2 


(b)   From  (3.440), 


Var(y)[Vl-Q3_^]=^(I^_j-q^_^) 


Thus  Var(w)[I|jj_^-Qg_^]/5  is  idempotent.   Since  rank(Ijj|_-^  -  Q^.^) 
=  m-l-(Cg_-,^-l)  =m-Cg_j  (recall  (4.333)  and  Lemma  3.14(a)),  it 
follows  that  (see  (3.436)) 


s-1 
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To  test  Hq:(7^_j=0  vs.  Hj^ta^j^O  we  form 


F    -'^'(qs-l-qs-2)^/(^s-l-^s-2) 
s-1         SSEy/(m-Cg_i) 


(3.441) 


Under  Hq,  f's-l  ~  f'cg_j  -  Cg_2,m  -  Cg_i  •   Under  H^,  the  distribution  is 
complicated.   However,  we  still  have 


y'(«s-i-'3s-2)y 


*^s-l~*^s-2 


C3_i-C3_2  tr[(Q^_l-q^_2)(Var(y))] 


^s-l~'=s-2 


tr 


(Q 


•1  - 


s-1 


Qs-2)(EMiHXi+^Vl)  ( 


see  Lemma  3 


•13(b)) 


"^s-l-S-2'^^"s-iqs-lHiAs-l";-4-lQs-2Ms-lH; 


+  ^(«s-l-Qs-2)] 


(see  (3.347)) 


"^-^S-l-'s-2'"[^^s-l-qs-2)HiAs-lHi] 


-^+s_i^-^^.2  M^s-l"l(qs-l-qs-2)HlVl] 


=  .5  + 


2      C   1 
O^t  1       S-1 
s-1      v^  ^+/ 


^s-1  -  ^s-2  S  ^^'-1 '  j'^'^^-1  "  ^s-2)^s-l ,  j 


where  a* 

s 


-l,j-"i^s-l,j'  J  =  1'2,-  •  •,Cg_j,  and 


's-1 


^ps-i,r^ 


s-1, 2=  ■  ■  ■  •^•s-l 


'^s-J' 


Thus  we  reject  Hq  for  large  values  of  F^_^    in  (3.441), 
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9     9  9 

3.4.5.   Exact  Tests  Concerning  <Tj  ,  o-j ,  •  •  ■  ,  <'^g_2 


Introduction .   Recall  that  model  (3.429)  mimics  a  random  un- 
balanced (s-1)— fold  nested  model  where  the  imbalance  affects  the  last 
stage  only.  Thus  we  can  adopt  the  procedures  developed  in  Khuri  (1990) 
to  aid  us  in  the  construction  of  our  exact  tests.   In  that  article, 
Khuri  was  able  to  develop  his  exact  procedures  by  first  averaging 
over  the  last  subscript  and  then  making  a  transformation  that 
resulted  in  a  random  vector  having  a  zero  mean  vector.   However  this 
new  vector  had  a  variance-covariance  structure  that  was  not  diagonal 
(see  Khuri  (1990,  p.  182)).   To  obtain  a  diagonal  variance-covariance 
matrix,  Khuri  made  another  transformation,  this  time  based  on  a 
resampling  scheme.   Thus  these  transformations  effectively  reduced 
the  analysis  of  the  unbalanced  model  to  that  of  a  balanced  one. 

We  can  employ  the  same  ideas  to  our  present  situation.   However, 
some  care  needs  to  be  taken  since  model  (3.429)  is  not  exactly  (in  a 
statistical  modeling  sense)  a  random  unbalanced  nested  model.   This 
is  true  since  the  HjA  •  ( j = 0, 1 , •  •  •  ,  s-1)  are  not  design  matrices. 

Preliminary  development.   Recall  model  (3.429).   Namely, 

v  =  iii^,i!,+  ■  ■  ■  +Ms-iL-i+i*- 

Define  the  c   -|X(m-l)  matrix  D*  as 
D*  =  (A'3_iHiH,A3_i)-A;_iH; 
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=  (As_iAg_;^)-U'g_-^H'i  (see  Lemma  3.14(b)).     (3.442) 

Compare  the  definition  of  D*  to  that  of  D  given  in  (3.411).   The 

matrix  D  was  used  in  (3.412)  to  average  over  the  last  subscript  in 

model  (3.380)  (or  model  (3.383)V   Similarly  to  (3.412),  define  the 

c   1  X 1  random  vector  w  as 
s-1 

y  =  D*w.  (3.443) 


Equation  (3.443)  can  be  viewed  as  "averaging  over  the  last  subscript" 
in  model  (3.429). 

Although  E(w)=0  (see  Lemma  3.13(a)j,  it  is  still  convenient,  for 
mathematical  reasons,  to  make  a  transformation  that  results  in  a 
random  vector  having  a  "zero  mean  vector."   This  transformation  is 
similar  in  spirit  to  that  given  in  (3.420). 

Let  H*  represent  the  last  Cg-j^-l  rows  of  the  c^_^xc^_^   Helmert 
matrix  (see  Searle  (1982,  p. 71)).   Then,  similar  to  (3.419),  H* 
possesses  the  following  properties: 

(i)    HrHr=i,^_^_i. 

(ii)   HrHr  =  Ic   .-Jc   ,•  (3-444) 

s-1     s-1 
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Let  u  be  the  (c^  -j^  — l)xl  random  vector 

u*  =  H*w  (3.445) 

The  next  lemma  establishes  some  statistical  properties  for  u*. 
Lemma  3.15 

The  random  vector  u*  is  normally  distributed  with 


(a)   E(u*)=0^    , 


(b)   Var(u*)=  E\*BiH*Vl+4_lI,  ^.^+Sn(A',_^A,_0-'K , 

1  —  1  s  —  i 


where 


Bi  =  Ic.®Jc., 

i  =  1,2,  •  ■  •  ,8-2  (3.446) 

s-1 

Cj  =.  n  a., 

^  j=i+l  J 


(Note  that  c.c^=Cg_^,  i  =  1 ,2,  •  •  ■  ,s-2,  see  (3.389).j 


(c)   u*  is  independent  of  SSE^^,  where  SSE^^  was  given  in  (3.436) 
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Proof ; 


First  note  that  since  w  is  normally  distributed  (see  Lemma  3.13) 
u   is  normally  distributed  (see  (3.443)1,  and  hence  u*  is  normally 
distributed. 

(a)   From  (3.443),  (3.445),  and  Lemma  3.13(a)  we  have. 


E(u*)  =H*E(w) 


=  H*D*E(w)=0^    -, 


(b)   From  (3.443),  (3.445),  and  Lemma  3.13(b)  we  have 


Var(u*)  =H*D 


1=1 


D*'H*' 


s-1 
=  i;  H*D*HiA.HiD*'H*V^  +  6H*D*D*'H*'. 
i=l 


(3.447) 


Now, 


lu*/ 


(i)    uiD*D*'nr=^i(A's.iK-iyK-iK^^s-i(K-iK-ir'^t 


(see    (3.442)) 


=  «r{(A's-lAs-l)-X-lAs-l(A's-lAs-l)-^ 


-  (^'s-l^s-irX-l3mK-l(K-lK-iy'yi' 


(see    (3.419)). 
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From  (3.413), 


(A's-lAs-l)"'A'3_iJ„  =  (  ©  m^-^   1'     )il„i;. 

"s-l  ^  ^        s-1 


-ll      1' 

s-1 


Hence,  from  (3.444), 


-1a'   "f  _  lu*i     1' 


K(^'s-l^s-iy'^'s-l~^ra  =  iKlc^_^l'^  =  Qc^_^-l-  (3-448) 


Thus, 


H*D*D*'H*'==H*(A;_iA3_i)-%*'.  (3.449) 

(ii)   In  the  first  terra  of  (3.447)  consider  H*D*HiA^ ,  i  =  l,2,---, 
s-1.   We  have,  for  i  =  1 ,2, •  • •  ,s-l , 

HrD%Ai=Hr(A'^_lA^_l)-iA'3_lHiHiAi         (see  (3.442)) 

(see    (3.419)) 
=  Hl*(A's-lAs-l)"X-lAi-  H  (2-^^^)) 


Now,  since  range(Aj^)  C  range(A  •)  ,  1  <  i  <  j  <  s-1  fsee  (3.416)j, 

5  •  of  order  c   ^  x  c • , 
J  =  -■■   J 


J- 

there  exists  matrices  B-  of  order  c   ^xc-,  l<j<s-2,  such  that 

1  ^  -»■   .1 
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^j=As-lBj'      J  =  l,2,..-,s-2.  (3.450) 

Consequently,    for   i  = 1,2, • • • ,s-2, 

=  HfB.. 
That   is 


*   *  (HTB-  ,      i  =1,2,  •  •  •  ,s-2 


Therefore,    from   (3.449),    (3.451),    and    (3.447)    we   have 


Var(u*)  =^Z  H*D*HiA.H'iD*'H*V|  +.5H*D*D*'H*' 
i  =  l 


:'E  HrBiB'.H*V|  +HrHra|_l  +ffl*(A'3_iA^_,)-iH*' 
1=1 


s-2 


EHrBiH>?+I  i-l_i+^H*(A;_,A3_,)-iHr, 

1=1  s-i 


where  B-  =B^B'.,  i  =  1 ,2,  •  •  •  ,s-l .   From  (3.413)  and  the  fact  that 
^s-1  ^^^   full  column  rank,  B-  can  be  (uniquely)  expressed  as 


Bi  =  Ic.®lc.»   j  =  l,2,- • -,8-2  (3.452) 

•^     J    J 
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where 


s-1 

n  . 
J  k=j+l 


^^=.    ^  .^k'   j  =  1)2,  •  •  •  ,s-2. 


Thus, 


B.  =  B.B'.  =  Ic.®Jc.,   j  =  l,2,-  •  -,8-2. 


(c)   Since  u*  =  H*ai,  it  suffices  to  show  that  y  and  SSE^  are 

independent.   From  (3.442),  (3.443),  and  (3.436),  we  have 

(A'3.iA3_i)-V3_iHi[Var(c,)](I„_i-Q3_i) 

=  «(A'3_iA3.i)-X-lH'i(Im-l-qs-l)  H  (3-440)). 
However,  since 

range  (HiAg_^  )=  range  (Qg-,^)        (see  (4.333)j 
it  follows  that 

A's-lHi(Vl-'ls-2)=0- 
Thus  u   (and  hence  u*)  is  independent  of  SSE^,.  D 
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In  the  expression  for  Var(u*)  given  in  Lemma  3.15(b),  the 
matrices  HjB^H*',  i  =;  1 ,2,  •  •  • ,s-2,  are  analogous  to  their  counterparts 
in  a  balanced  model  in  the  sense  that  they  can  be  shown  to  commute  in 
pairs.   This  is  the  focal  point  of  the  next  subsection. 

The  matrices  H*B^H*',  i  =  1 ,2,  •  •  • ,s-2.   In  this  subsection 
we  show  that  the  matrices  H*B^H*',  i  =  1 ,2,  •  •  • ,s-2,  commute  in  pairs 
(see  Lemma  3.16).   To  do  so,  we  first  note  that  for  l<i<j<s-2, 

BiBj  =  (Ic.®Jc.)(Ic.®Jc.)      (see  (3.446)) 


=  (Iai®Ia2®  •  •  •  ®Ia.  ®Ja.^.  ®  "  "  "  ®  ^a  • "  "  "  ®  ^a   -,  ) 

i  i  1      1+1  J  s-1 

(see  (3.389)) 


x(Ia  ®Ia,®  •••®Ia.®Ia.  ,  ®  •  •  •  ®  la  •  ®  Ja  •  ,  ®  •  •  •  ®  ^a   J 
^     •*  1     1  +  1  J     j+1  s-1 


Ic.  ®  Ja.^j®  ■  ■  ■  ^^^j^^j+l-^s..^^®  ■  ■  ■   ®ag_jJa^__^ 


=  Cj(Ic.®Jc.).  (3.453) 

Consequently,  B^B-  is  symmetric  which  implies  that  B-  and  B-  commute. 


Lemma  3.16 


The  matrices  H*Bj|^Hj'  commute  in  pairs  for  i  =  1 ,2,  •  •  •  ,s-2 . 
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Proof ; 


Without   loss   of  generality   take   l<i<j<s-2.      Then, 

(H*B.Hr)(H*BjH*') 

=  HrBi(Ie^_^-Jc^_l)BjHr  (see    (3.444)) 

=  H*B.B.H*'-H*B.Jc     ,B.H*'.  (3.454) 

1    1    J   ^         ^1      s-1    J 

Now,    from   (3.446), 

=  c^(Ic.®Jc.)(Jcj®Jc.)' 

^S-1  1  ^         1 


-_li_J 

"S-i  S-1' 


recall  that  c-c-  =Cg_i,  i  =  1 ,2,  •  •  •  ,s-l  (see  (3.389)  and  (3.446)). 


Consequently, 


H*^iJc._i  =^«*-^c,_1  =^-  ("^^  (^•^^^)) 
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Thus,  the  second  term  in  (3.454)  vanishes  and  hence 

(HtBiHr)(HrBjH*')=HtBiB.Hr. 

Therefore,  since  B^B-  is  symmetric  fsee  (3.452)j,  it  follows  that 
(H*B.H*')(H*B -H*')  is  symmetric  and  consequently  H*Bj^H*'  and  HjBjH*' 
commute. 


D 


This  lemma  indicates  that  there  exists  a  (Cg.i  -  1) x (Cg-i  ~  1) 
orthogonal  matrix  S  which  simultaneously  diagonalizes 

H*BiHi', •  •  •  ,H*B  _2Ht'-  Before  we  apply  this  transformation  let  us  state 
some  further  properties  concerning  the  HjB^Hj  ,  i  = 1 ,2,  •  •  • ,s-2. 

(i)   The  matrices  B^^,  i  =  1 ,2,  •  •  •  ,s-2,  defined  in  (3.452), 
satisfy 

range(Bi)  C  •  •  •  C  range(Bg_2) •         (3.455) 

Thus,  it  follows  that 

range(H*Bi)  C  •  •  •  C  range(H*Bg_2) •       (3.456) 


(ii)   For  i  =  1,2, •  •  •  ,s-2. 


rank(H*B.)=rank(H*B.H*')=c.  -1.       (3.457) 
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To  verify   (3.455)    let   1  <  i  <  j  <  s-2  and   note   that  fsee    (3.389)    and 
(3.446)), 


Hence,    from   (3.452) 


/C 

c- 


^i=R>. 


«i  =  S^K  =  S^i(!j),. 


Ic.®lcj®lc. 


(lc.0lc.®lc.)(lc.®lc.) 

1  J  J  ■*■  ii 


(Ic.®lc.)(Ic.®ic.) 
J  J  ■'■  d. 


=  B.(Ic.«>lc.) 

^        1        J 

c- 


Therefore,    range(B^)  C  range(B  •)  ,    l<i<j<s-2.      Since,    for 
1  <  i  <  j  <  s-2, 

H*B.  =H*Bj(Ie.®lcj), 
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it  follows  that 


range(Hj'B^)  C  rangeCHj'B  •)  ,   1  <  i  <  j  <  s-2. 


To  verify  (3.457)  first  note  that 


^iic.=lcg_^'   i  =  l,2,---,s-2 


"*-S-l"-c^_l-l       ("^^  (3.444)). 


Consequently,  the  c-  columns  of  H*B-  are  linearly  independent 
rank(H*B^)  < c^-1,   i  =  1 ,2, •  •  •  ,s-2.   On  the  other  hand,  by  the 
Frobenius  Inequality  (see  Lemma  A.l),  for  i  =  1 ,2 ,  •  •  ■  , s-2 

rank(H*B.) > rank(H*) + rank(B. )-rank(Ic    ) 


=  Cg_i-l+C.-Cg_i 


=  Ci-l, 


Thus,  rank(H*B.)  =rank(H*B.H*')  =c.-l,  i  =  1 ,2,  •  •  •  ,s-2. 

As  indicated  earlier,  Lemma  3.16  implies  that  there  exists  a 
(Cg_-|^  —  1)  X  (Cg_j  X  1)  orthogonal  matrix  S  which  simultaneously 
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diagonalizes  Il*BjH*',  •  •  •  ,HjB   o^i'-   The  construction  of  S  is  our  focus 
in  the  next  subsection. 

The  construction  of  S.   We  now  describe  how  to  construct  the 
(c  _i  —  1) X (c   1  —  1)  orthogonal  matrix  S.   To  begin  with  consider  the 
matrix  B'- H*'H*B- ,  i  =  1 ,2,  •  •  •  ,s-2.   From  (3.446)  and  (3.452)  and 
recalling  that  c-c-=Cg_j,  i  =  l,2,---,s-2,  we  have 


'.H*'H:B.  =B'.  (Ir     -Jr    )B: 

1  1  1  1  iV  Cg_^  Cg_^/  1 


(Ic.(8)l^.)(Ic   ,-Jc   J(Ic.®lc.) 


=  (Ic.<8)l^.)-^^(Ic.®ls.)(Jc.®Jc.) 


(Ici®lcJ 


(Ic.0l^.)-c-^(Jc.  ®lfe.) 
^1   ^i    ^s-1   ^1    ^1 


(Ic.®ic.) 


((lc.-Jc.)®ie.)(ic.®icJ 


=  Cj(Ic.-Jc.),   i  =  l,2,-  ••,s-2. 


Hence,  a  generalized  inverse  of  B-H^  H^B-  is 


?' n*'H*R  ^--  1 


(B'iHrH*B.)-  =  J^I^.,   i=l 


= 1,2, • • • ,s-2. 


Consequently,  the  matrix  H*B.  (B'.Hj'HjB.  )  B'-H*' =  iH*B.H*'  is  symmetric, 
idempotent,  and  has  rank  c-— 1,  i  =  1 ,2, •  ■  •  , s-2.   Now,  consider  the 
identity 
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I      1  =H*H*'  (see  (3.444)) 


=  Ui+U2+---+U3_i, 


where 


U.  =  JrHrB.H*'-^H*B._iH*',   i  =  1,2,  •  •  •  ,s-l. 

(Ve  take  Cq  =  c^_-^;    Bq^O;  Cg_j  =  l;  Bg_-^  =  I.)   From  (3.456)  and  the 

fact  that  J-HiB-Hf'  is  a  projection  matrix,  i  =  1 ,2, •  •  • ,s-2,  it  follows 
^i    '■ 

that  U-  is  a  projection  matrix  and  furthermore  range(U£)  is  the 
orthogonal  complement  of  rangef^-^^ — H*B^_^H*'j,  i  =  1 ,2,  •  •  •  ,s-l  fsee 
Christensen  (1987,  p.  339)).   Also,  for  Cq  =  1 , 

rank(Uj)  = c^ -c^_^,  i  =  1 ,2,  •  •  • ,s-l . 

Let  S-  be  a  (c-  -  c- _,  )  x  (Cg_j^  -  1)  matrix  of  rank  c^-Cj_^  whose 
rows  of  orthonormal  and  span  the  row  space  of  U • ,  i  =  1 ,2,  •  •  • ,s-l . 
Then  similar  to  Khuri's  (1990)  Lemma  3.4  (ii),  it  is  easy  to  verify 
that 


flc-c  ,'   i  =  j  =  l,2,---,s-l 
o  c/.  _J   1    1-1 

^i^j-lo,       i^j. 


Now,  define  the  (Cg_-|^  -  1)  x  (Cgj  -  1)  matrix  S  as 

S=[Si:S^:...:S'3_i]'. 
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Then  S  is  orthogonal  and  diagonalizes  H*BjHj', •  •  • ,H*Bg_2Hi' 
simultaneously. 

In  the  following  subsection  we  consider  the  transformation 
X  =  Su*. 

The  transformation  x=:Su*.   Define  the  (Cg_j-l)xl  vector  x  as 


x  =  Su' 


where  S  is  the  orthogonal  matrix  mentioned  above  and  u   was  defined 
in  (3.445).   From  Lemma  3.15  we  have 

(3.468) 


Var(x)='EAi<T|+<T2_iI^   ^^+ 6Sllti^'^_-^k^_-^)-'^V'^ 
i=l  s-1 


where  A-  =SH*B-H*'S'  is  a  diagonal  matrix  consisting  of  the 
eigenvalues  of  HjB-H*'.   The  matrix  A-  is  of  order  (Cg_-|^-1) 

>^(S-1-1)' 

i=l,2,--',s-2.   Lemma  3. 17  provides  the  form  for  A-,  i  =  l,2,---,s-2, 

Lemma  3.17 

The  eigenvalues  of  H*B.H*'  are  0  with  multiplicity  c^^^-c^  and 
c-  with  multiplicity  c-— 1,  i  =:  1 ,2,  •  •  •  ,s-2,  fc-  is  given  in  (3.446)1. 
Furthermore,  we  can  express  A-  as 
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'Ic=-1  0> 


^i=^i   o'    0  '  i  =  l>2,.  ••,s-2.  (3.459) 


Proof : 


Since  HjB^Hj  is  of  order  (Cg_-j^  -  1)  x  (Cg_-|^  -  1)  and  has  rank  c.-l 
(see  (3.457)j,  0  is  an  eigenvalue  with  multiplicity  c   -.  —  1  —  (c-  —  1)  = 
Cgj— c^,  i  =  1 ,2, ■  •  •  ,8-2.   Also,  the  nonzero  eigenvalues  of  H*B-Hj' 
are  equivalent  to  the  nonzero  eigenvalues  of  BH*'H*,  i  =  1 ,2,  •  •  ■  ,s-2. 
However,  for  i  =  1 ,2,  •  •  •  ,s-2 

BiHrHi*=(Ic.®Jc.)(Icg_j-Jcg_j)  (see  (3.444)  and  (3.446)). 

Recalling  that  c^c-  =  Cg_-^  for  i  =  1 ,2,  •  •  •  ,s-2  (see  (3.446)  and  (3.389)], 
we  have 


^s 


rnVi^^^-^c.^j..]. 


Thus,  for  i  =  1 ,2,  •  •  •  ,s-2, 


iHrHr=(Ic.®Jc.)-c^(Ic.®JcJ(Jc.®Jc.) 


1   ^i   ^s-1   '^i    '^i 


=  (Ic^<S)JcJ-(Jc^®  Jc.)   (recall,  c^c^^^c^-^) 


=  (Ic.  -Jc.)®Jc.- 
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Now,  the  nonzero  eigenvalues  of  (I^  —  J^-  )  ®  J?^  are  the  products  of 

^i    ^i     ^i 

the  nonzero  eigenvalues  of  (I_  -  J^  )  with  those  of  J-=    .      Thus,  since 
Iq.  —Jq-     ^®  idempotent  with  rank  c-  —  1  and  the  only  nonzero  eigenvalue 
of  J^.  is  c^ ,  the  nonzero  eigenvalues  of  B-H*'H*  are  c-  with 
multiplicity  c^-1,  i  =  1 ,2,  •  •  •  ,s-2. 

Since  the  diagonal  elements  of  A-  are  the  eigenvalues  of  H*B-H*' 
and  since  (3.456)  implies  that 

range(Aj)  C  range(A2)  C  •  •  •  C  range(Ag_2)  which  in  turn  implies  that 
rankCAi  +  Aj-f-  •  •  •  +  A  •)  =  rank(A  .)  =  c  ■ -1,  j  =  1 ,2,  •  •  •  ,  s-2 ,  we  can, 
without  loss  of  generality  express  A-  as 

^=^i(^  0     oj'  i  =  l>2,---,s-2.  □ 

In  (3.458)  the  variance-covariance  matrix  of  x  is  not  diagonal. 
To  diagonal  this  matrix  we  make  a  transformation  utilizing 
resampling.   This  is  done  in  the  next  subsection. 

Utilizing  resampling.   Consider  once  again  SSE^^  defined  in 
(3.346).   Let  R   be  the  (m-l)x(m-l)  symmetric,  idempotent  matrix 

^*-Vl-Qs-l-  (3-460) 

(Qj,  j  =  l,2,  •  •  •  ,s-1,  is  defined  in  (3.433)).   Then 


SSE^^  =  wR*w. 


Clearly  R   has  rank  "i  — Cgi.   Now,  we  can  express  R*  as 
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R*  =  C*A*C*'  (3.461) 


whe 


C*  =  [Cr:C2*:C3*] 


A*  =  diag(I  *,I  ^,0) 


'?r=^s-i-i 


r]2=m-2c^_^  +  l 


(3.462) 


(3.463) 


and  Cj  ,  Cj  ,  and  C3  are  of  orders  mx?;*,  m  x  7?| ,  and  m  x  c  _^  , 
respectively.   Ve  note  that  rj*  +  i]*  =  m- c^_-^  =  ra.nk(K*) .      Also,  rj*  >  0   by 
(3.393). 

In  order  to  diagonal  the  variance-covariance  matrix  of  x  (see 
(3.458)j  we  introduce  the  following  transformati 


:ion 


1 
r  =  >?+('^maxl  *-L*)2C*'y,  (3.464) 


Vl 


where  L*  =  SH*(A'g_jAg_-,^)  ^H*'S'  and  A*a.x  is  its  largest  eigenvalue.   The 

vector  X  was  defined  in  (3.457)  and  C*  in  (3.462).   The  random  vector 

1 
y  was  given  in  (3.429).   The  matrix  (A*^^!  +  -  L*)2  is  well-defined 

since  Aj^g^^I  *-L   is  p.s.d.   It's  eigenvalues  are  the  square  roots  of 

the  eigenvalues  of  Aj^g^^I  +  -  L*  . 


-245- 


Theorem  3.14. 


The  random  vector  r  is  normally  distributed  with 


(a)   E(r)=g^     1 
^s-1   ^ 


(3.465) 


(b)   Var(r)  =  E^Aicr|+6*I^    _^ 
i=l  s-1 

where  "J*  =  <Tg_j  +  A*a,x'^  ^nd  8   was  defined  in  Lemma  3.13(b). 
Proof : 

The  proof  of  Theorem  3.14  follows  similarly  to  that  of  Lemma 
3.13  and  is  therefore  omitted.  Q 

The  exact  tests.   Let  us  partition  r  as 

l={l'i-l2--  ■  ■••Z's-l)'  (3.466) 


where  r^  is  of  order  (c-  -  c- _-^)  x  1 ,  i  =  1 ,2,  •  •  •  ,s-l  (cq  =  1).   Note  that 
s-1 

zJ  (cj^  -  c^_j)  =  Cgj  -  I.   Then  the  following  facts  are  apparent: 
i  =  l 


(i)    ri~N(0,  (|:.<r2jCj+^*)I^._^._J,  i  =  l,2,...,s-2 

(3.467) 

I„  1  ~N(0,  f*Ic    _c    ). 
s-1    v-»    Cg_j   Cg_2>' 
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This  follows  since  we  can  use  Lemma  3.17  to  express  Var(r)  in  Theorem 
3.14(b)  as 

Var(r)=diag((^EW+.*)I^^_,,  (  g^j  +  ^*)Ic,  -  c, ,  "  "  "  , 


(ii)   Ii '12' ■  ■  ■ 'Is-l  ^^^   mutually  independent. 


/  s-2  „      N 
(iii)  I-Ii/f  E'^'iC.  +  6*j~Xc.-c.  -,.  i  =  1,2,.  • -,3-2 

(3.468) 


:'s-irs-i/^*~x^c,_^ 


Thus,  if  we  wish  to  test  E^-.a^-O   vs.  Ha,:<7-?7^0,  i  =  1 ,2,  •  •  •  ,  s-2,  an 
appropriate  test  statistic  would  be 


Fi=  /  '  ' jj '-^   ,,   i  =  l,2,.-.,s-2.      (3.469) 

'  ^i+i^i+i/C^i+i-S) 


Under  Hq,    ^i  ~  f'c.  -  c- _-^  ,c.^^  -  c- .    i  =  1 ,2,  •  •  •  ,s-2.      Under  H^,, 
Fi~(l+CiCi)Fc._e._^,Ci^l-c.    -here 


<T? 


<i=    s-2       ' '  (3.470) 

j=i  +  l    -^    ^ 


'2-     +5* 


i  =  1,2, •  •  •  ,s-2.   Since  c^  > 0  for  all  i  =  1,2,  •  •  • ,s-2,  we  would  reject 
Hq  for  large  values  of  our  test  statistic,  F-  ( i  =  1 ,2, •  •  •  ,  s-2) . 


-247- 


3.4.6.   Reduction  of  the  Exact  Tests  when  the  Design  is  Balanced 

Introduction.   In  this  section  we  show  that  the  exact  tests 
reduce  to  the  usual  ANOVA  tests  when  the  design  is  balanced.   For 
testing  EQ-.cTg-O   vs.  E^^-.a^^O   the  test  statistic  Fg ,  in  (3.407), 
clearly  reduces  to  the  usual  ANOVA  test  since  it  is  based  on  the  same 
mean  squares  as  are  used  in  a  balanced  data  situation.   In  the 
remainder  of  this  section  then,  we  concentrate  on  showing  that  the 
exact  tests  developed  for  testing  HqIct?  =0  vs.  E^^-.a^-   ^0, 
i  = 1 ,2, •  •  •  ,s-l ,  reduce  to  their  ANOVA  counterparts  when  the  design  is 
balanced. 

Reduction  of  the  exact  test  for  Hq:<t^_-[^  =  0.   The  test  statistic 
derived  for  testing  Hq:<t|_^=0  vs.  E^:(t'^^_^  :^  0   was  defined  in  (3.441). 
It  was  based  on  the  random  vector  w  defined  in  (3.426).   Now,  when 
the  design  is  balanced,  that  is,  when 


ma    =  a_ 
^s-1    ^ 


and 


(3.471) 


"^^s^^s+l' 


the  matrix  K  fsee    (3.418))  becomes 


K  —  5 — If.    . 

^s   ^s 
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Hence  L  (see  (4.427)1  reduces  to 


ag  1  1   ag  Cg-l 


(- 


this  case  Cg  =  m  =  n  a-  (see  (3.389)]). 


Thus  A^g^j^,  the  largest  eigenvalue  of  L,  is  ^.   C 
(3.426))  simplifies  to 


^.   Consequently  w  (in 


w=:u  =  Hiy       (see  (3.420)). 


From  (3.441)  we  can  express  the  test  statistic  as 


F    _^'(Qs-l-qs-2)'^/(S-l-S-2) 
s-1         SSEy/(m-C3_i) 


_rHUqs-i-qs-2)Hiy/(s-i-s-2)  /  ,,  ,,  ,,,,\ 

-  -/U//T n MT  -  / / ^ —  (see  (3.426)  . 

y"i(im-i-tis-i)Hiy/('"-^s-i)    ^     ^ 

(3.472) 


Now,  from  (3.433)  and  (3.413),  when  the  design  is  balanced  we  have 
for  j  =  1,2,  •  •  • ,s-l, 

qj  =  H,Aj(A'jAj)-VjHi 


=  FVHi[Ic.®Jc.a   ]H'r 


(recall    (3.446)   and    (3.471)) 
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Also,  from  (3.419), 


HjHj  —  I^  — J^  =  Ij,  —  Jc     (*^s  =  '''  when  the  design  is  balanced) 
s    s 


=  (Ic.®Ic.aJ-^(Jc.®Jc>s) 
J     J  ^    ^    J     J 


Consequently  for  j  =  1,2, •  •  •  ,s-l. 


HiQ jHj  = 


(Ic.®Ic.aJ-(J 
J     J  =" 


C.  Cjag^ 


Ic  .®  Jc -a 


(Icj®Icja3)-(Jcj®Jcja3) 


(Ic.®Jc.as)-(Jc.®Jc.as) 


(Icj®Icja3)-(-Jcj®Jcja3) 


=  ((!   -J   )®J^   )     ((Ic.-Jc.)Jc.=0)-       (3.473) 

Thus,  when  the  design  is  balanced,  Hi(qg_ ^  -  qg_2)Hi  in  the  numerator 
of  (3.472)  becomes 


[('S-1  -\-l)  ®\-ias]-[(Ic3.2- JC3_2)  ^  J^ 


s-2^s 


[«^ 


'   ^®Ia     )-(Jr 
'S-2    *S-1^   ^  '^s-2 


))®J7^ 


L  ^s-2   *^s-2'^    <=s-2^s 
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(recall  that  Cs-2~^s-l  ^""^  ^s-1  — ■'^)"   Hence  the  numerator  of  F   -, 
reduces  to,  finally 

y'[lc3_2®(Ia^_l-Ja^_l)®Ja,^/(Cs-l-Cs_2)-      (3-474) 
Similarly,  we  can  show  that  Hj(Ijj|_ j  -  Qg_^)H]^  reduces  to 

Hi(  Vl  -  «s-l)Hi  =  Ic^_i  ®  (lag  -Jag)  (3-475) 

when  the  design  is  balanced  (recall  that  Cg = m) . 

In  a  balanced  data  situation,  y  (see  (3.411)  and  (3.412)]  becomes 

^  =  ^('-s®iag,i)y.  (3.476) 

Therefore  the  numerator  and  denominator  of  (3.472)  become, 
respectively,  aside  from  a  constant, 

4y'[lcg_2®(V,-Ja3_l)®^a,®Ja3,l]y         (3-477) 
44^C3_i®(Iag-Jag)®Ja3^l]y-  (3-478) 
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Hence  agw'(Qg_j  -  Qg_2)w  and  agSSE^  reduce  to  the  usual  ANOVA  sums  of 
squares  from  a  balanced  model  (see  Khuri  (1982)).   Thus  F  _-■  (in 
(3.472)j  reduces  to  the  usual  ANOVA  F-test  associated  with  the 
hypothesis  Rq-.ct^    -,=0. 

Reduction  of  the  exact  tests  for  UqKt^-  =0,    i  =  1 ,2,  •  •  •  ,s-2.   When 
testing  HQ:cr^=0,  j  =  1 ,2,  •  •  •  ,s-2,  we  concerned  ourselves  with  the 
vector  T   defined  in  (3.464).   When  the  design  is  balanced,  the  matrix 
L   (given  in  the  expression  for  r)  reduces  to 

L*  =  SH*(A'g_lA3_-^)-iH*'S' 

=  SHf(^Ic^)H*'S'  (see  (3.413)  and  (3.471)) 


^s 


(see  (3.444)) 


a— 1_    _ I .       (S  is  orthogonal) 


^s  S_i-1 


Consequently,  its  largest  eigenvalue,  X^^^,    is  ^.   Therefore,  wh 
the  design  is  balanced,  r  becomes 


en 


T  =  X  —  Su'* 


(see  (3.457)) 


=  SHfw        (see  (3.445)) 
=  SH*D*y      (see  (3.446)). 


Hence, 
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t't  =  y'D*'H*'S'SH*D*y 


=  c.'D*'(Ic^_^-J^^_^)D*y      (see  (3.444)) 


w'D*'D*w  -  u'd*1^        D*w 


's-1 


^M^c^_^®Ja.^)K 


^HiCJc   -,®JaJH'iV   (see  (3.442)  and  (3.413) 


i-'Hi 


(If.    -J^   )®J, 


^JHiy. 


(3.479) 


As  mentioned  previously,  when  the  design  is  balanced,  w  reduces  to 


Hiy  =  a^— H,(Ir  (g)li    )y. 


Consequently,  after  some  simplification,  we  can  express  (3.479)  as 


^sa-s-^l- 


,  *^s-l   '^s-l^   ^s   ^•s+l/ 


(3.480) 


Now,  recalling  (3.398),  when  the  design  is  balanced  we  have,  for 
j  =  1,2,  •  •  •  ,s. 


Pj  =  ^j(^?j)"^j 
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=  Ic.®Jn.       (see  (3.384)  and  (3.471)), 


J    J 


where 


s+1 
n  .  =  n/c  .  =  .  n  a.  ,   j  =  1,2, 
J      J   1=1+1  ^ 


s+1 
n  =   Ha-, 
i  =  l  ^ 


Therefore  (3.480)  can  be  written  as  (letting  Pq  =  Jjj) 


--   ^s^s+l-L  ^s-1   ^s   ^g^j^   ^  Cg_-^   as'<^  ^s+l'^J^ 


However, 


"^i^^'^^-l-^0]y-  (3-481) 


s-1 


.1=1   -^   -^ 


therefore 


^'^  =  ^i4^K'|/^J-^M* 


_    ,   s_^l 


^^^.5/(pj-^j-i)^ 


From  Khuri  (1982),  the  sums  of  squares  y'(P  •  -  P -.i )y,  j  =  1 ,2,  •  •  •  ,s-l , 
are  the  usual  ANOVA  sums  of  squares  from  a  balanced  model  associated 


-254- 


with  the  effects  0,,---,p      -i    (note  that  our  "P  •  —  P  •  h"  is  Khuri's 

"p.").   Thus  a„a„.i  t'-t  ■    reduces  to  the  usual  ANOVA  sum  of  squares, 
J  ^  s  s-(-i  -j-j 

j  =  1 ,2,  •  •  •  , s-1 .   Consequently,  the  test  statistic  developed  to  test 
Hq:(t|=0,  i  =  1,2,  •  •  •  ,s-2,  (see   (3.469))  reduces  to  the  ANOVA  F-test 
when  the  design  is  balanced. 

Expressions  for  the  power  of  the  exact  tests  are  developed  in 
the  next  section. 

3.4.7.   The  Power  of  the  Exact  Tests 

Introduction .   Ve  now  investigate  the  power  of  the  exact  tests. 
Ve  first  concentrate  on  developing  an  expression  for  the  power  of  the 
test  associated  with  a-,    i  =  1 ,2, ■  •  • , s-2 .   Since  the  imbalance  affects 
the  last  two  stages  only,  our  expression  for  this  power  function  will 
be  exact. 

The  approximate  power  of  the  test  associated  with  (^\_-i    is 
derived  using  Hirotsu's  (1979)  procedure  (see  Appendix  D) .   A  similar 
expression  can  also  be  developed  for  the  power  of  the  test  associated 
with  CTg. 

The  power  of  the  exact  test  associated  with  HqIct?  =0, 
i  =  1 ,2,  •  •  •  ,s-2.   In  testing  HQ:or?  =0  vs.  U^ia'^-^O,    the  appropriate 
test  statistic  was  given  in  (3.469).   If  "^ ■    denotes  the  power 
associated  with  this  test,  then,  for  i  = 1 ,2, ■ • • ,s-2, 

*i=P(Fi>Fa,c.-Ci_i,Ci^l-cJH^)  (3.482) 
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Under  H^,  F^  ~ (1 + c-C: )Pr.  -  r     r         -r      where  c-  was  defined  in 
(3.446)  and  Cj  in  (3.470),  i  =  1 ,2, •  •  • ,s-2.    Consequently,  (3.482) 
becomes 

"*;  =P(Fc.  -c-  .  c  .  -c-  >  1  _L  -  ^  Fq  c.  -c     c    -r  )»        (3.482) 

(i  =  1 ,2,  •  •  •  ,s-2)  .   Since  ^-  is  a  monotone  increasing  function  of  C^- , 
it  is  a  monotone  decreasing  function  of  6*  (everything  else  being 
fixed)  where  5*  was  defined  in  Theorem  3.14(b).   (Recall  that 


cj\ 


'>i  —  s-1  '   ^   —  lj-^5'''>S-Z, 

j=i+l  ^    ^ 


(see  (3.470)).)  From  the  definition  of  b*   in  Theorem  3.14(b)  we  have 


'^*  ='^s-l+'^max<^ 


=  <^s-l+'^max(4  +  ^max<^?)     (see  Lemma  3.13(b)J. 

It  is  thus  desirable  for  A*^^^^  and  A*a^xAj„a^3^  to  be  small.   A  sufficient 
condition  for  this  to  occur  is  when  both  A*^^,^  and  A^^^,^  are  small.   The 
next  lemma  provides  bounds  for  A*a,x  and  A^g^^. 
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Lemraa  3.18 


•^max  ^'^^   ^max  satisfy  the  double  inequalities: 


s-1  a  p   1         „U 

''s^l      s-1         m^ 


(b)  m    En7-<^max<-rTT'  ^^^^^ 
0s    ^s         n*^^^ 


(0 
n^  '  =  min  Un 

0s       ^s 


(i) 
1^  '  =  mm 


^s-1   ''«-! 


Proof : 


(a)  ^^Q,x    is  the  largest  eigenvalue  of  the  (c   i  —  1)  x  (c   -.  —  1)  matrix 
L*=SH*(A'g_jAg_j)-^H*'S'  (see  (3.464)).   Thus 

^max  =  emax(sHr(A'3_iA3_i)-iH*'S') 

=  emax(Ht(A's_iAg_i )-%*')       (S  is  orthogonal) 

<  emax[(A's_iA3_i)-%ax(Hr'Hn 


~  ^maJ  (As_i  A   1  )  (H*  has  orthonormal  rows) 


emax(  ®  '"^'  .)       (see  (3.413)) 
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1     u  (i) 

-r^,    where  m^  ^ 

,(0' 


min  m£ 


's-1 


It  is  also  true  that 


A*   > 


-l3^tr[SHr(A;_iA^.l)-%*'S'], 


the  term  on  the  right  being  the  average  of  the  eigenvalues  of  L'' 
Consequently, 


A*   > 


7rTtr[(A'3_iA3_i)-iH*'H*] 


:pn4(^s-l^s-l)-'-(A's-lAs-l)-^Jc3_J 

fsee  (3.444)) 

^f'{(*'s-lAs-l)-']-^!'c3_i(A'3.,A._i)lc3.J 


'8-1 


X  "^^  .  c: 


E~-^ 


^s-i  "^-^     ^-'Vi  "^-1 


(see  (3.413)) 


's-1  ^^   ^s-1 
s-1 


Hence,  the  result  follows.   The  proof  of  (b)  is  similar. 


D 


It  is  more  important  that  A*  „  be  small  than  for  A„„y  to  be 


max 


small.   Even  if  -^max  ~  ■'■  (it's  maximum  possible  value  regardless  of 
(nested)  design),  the  product  A*ax'^max  ^iH  be  small  if  A*ax  is  small. 
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The  power  of  the  exact  test  associated  with  lln'.a^    .,   =0.   We  now 
obtain  an  expression  for  the  power  of  the  test  for  Hq:  cr^-.  =0.   In 
this  case  the  appropriate  test  statistic  is  (see  (3.441)1 


p    _^^(qs-i-qs-2)^/(s-i-^s-2) 

s-l         SSEy/(m-Cg_i) 


where  Q-  was  defined  in  (3.433),  j = 0, 1 , •  •  •  ,s-l ,  and  the  denominator 
in  (3.436).   The  random  vector  w  was  given  in  (3.429).   Under  R^, 
w'(qg_j — Qg_2)y  is  distributed  like  a  random  variable  of  the  form  [see 
Box  ( 1954a) j 


Cg_2~Cg_2 

T=     E    'T-x-,  (3.484) 

j  =  l     -^  ^ 


where  the  x's  are  independent  chi-squared  variates  with  one  degree  of 
freedom  and  the  tt's  are  the  nonzero  eigenvalues  of  the  matrix 

^  =  (Qs-l-Qs-2)(V^^(y))  (3.485) 

Similarly  to  the  latter  part  of  the  proof  to  Theorem  3.13(a),  we  can 
simplify  (3.485)  to 


-^  -  (Qs-l  -  «s-2)Ps-lMs-lH'i  +^I„_i]. 


The  exact  distribution  of  T  is  complicated.   To  obtain  an 
approximate  expression  for  the  power  we  use  the  approach  described  in 
Hirotsu  (1979)  (see  Appendix  D) .   Let 
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MSEy  =  SSE^/(m-Cg_^) 


(3.486) 


USEJS 


(3.478) 


h— J— F^  _    __    m  —  <~       (3.488) 


where  in  this  case 


tr(Jt2) 
tr(Jt) 


(3.489) 


^^[tr(^)f 
tr(  Jt^)  ' 


(3.490) 


Then  the  approximate  power  is  given  by 


*s-l  =  P("  >  MHa)  =  P(Ff  ^f^  >  h)  +[^/{3(f+2)(f+4)B(if  ,lf2)}] 


'rf+2Vf+4^      2(f+f2)(f+4)      (f+V2)(f+f,)- 
^*^^^^^+^^         UV(fh)      +    {1+V(fh)}2 


(3.491) 


where 


f2  =  m-Cg_^ 


(3.492) 
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[tr(.^)][tr(Jt,^)] 
^  = r^-  /  ii2xn2 1  (3.493) 

and  B(a,b)  is  the  beta  function. 

Similar  to  the  derivation  of  E[w'(Q   h  — Q   2)y/(c„_-i  —  c   q)] 
given  after  (3.441),  we  can  find  the  trace  of  J^>   as  follows: 

tr(Jt)  =  tr[(Q3_i-Q^_2)(a2_lHiA,_lHi  +  n^_l)] 

=  Ks_i-c3_2)+4_itr[(Q^_l-q^_2)(HiA3_iH;)] 

=  ^(S-1  -  S-2) +'^s-lt^[A',-l"i(Qs-l  -  «s-2)Ms-l] 

=  ^(^s-l-^s-2)+'^s-l  fc  d,j(Qs-l-Qs-2K-l,j 

(3.492) 

where,  recall 


^s-l,j=Hi^s-l,j'   J  =  l,2,---,c-l 


and 

In  order  to  evaluate  (3.491)  we  also  need  tr(Jt'^)  and  tr(Jt^).   These 
quantities  can  be  expressed  in  much  the  same  way  as  tr(Jt)  in  (3.494) 
This  method  of  evaluating  these  traces  is  similar  to  the  idea  of 
synthesis  as  described  in  Milliken  and  Johnson  (1984,  p.  219-231). 
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3.4.8.  A  Numerical  Example 

Data  were  generated  according  to  the  following  4-fold  random 
nested  model: 

yijk«t  =  ^  +  '^i+^ij  +  Tijk  +  *ijk£  +  ^ijka' 

i  =  l,2,3;  j  =  l,2,3;  k  =  l,2,3;  £  =  1 ,2,  •  •  •  ,m.  j^;  t  =  1 ,2,  •  •  •  ,n.  jj^^; 
where  /i  is  an  unknown  constant  parameter;  and  a-  ,    /?•  •,  7-  -i,)  <5-  -up, 
and  f J  ik^t  ^^®  independent  normally  distributed  random  variables  with 
zero  means  and  variances  a^,    (Tq,    at,    ai,    and  a^,    respectively.   The 
™i ik  ^^l"^s  were  randomly  generated  from  the  integers  {2,3,4}  and  the 
°i  ik£  v3-lues  were  randomly  selected  from  {1,2,3,4}.   Furthermore,  we 
took  ;i=10,  (T^  =  3,  cr^  =  0.2,  (tI^-I,    <^|  =  5,  and  a^  =  1 .      These  values 
were  only  used  to  generate  this  (artificial)  data  set. 

Using  the  methodology  developed  in  Subsections  3.4.3—3.4.5,  we 
tested  the  following  hypotheses:   Hq:(t^  =  0,  HQ:cr^  =  0,  Eq-.^'I-O,    and 
Hq:(7^  =  0  versus  their  usual  alternatives.   Table  3.16  lists  some 
important  quantities  used  in  the  development  of  the  exact  tests. 
Table  3.17  summarizes  our  findings. 

3.4.9.  Concluding  Remarks 

In  this  section  we  developed  exact  tests  for  testing  the 
variance  components  in  a  general  unbalanced  s-fold  random  nested 
model  where  the  imbalance  affects  the  last  two  stages  only.   In 
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Table  3.16    Some  useful  quantities  used  in  the  construction  of  the 
exact  tests  for  the  four  variance  components  in  the 
example 


Quantity 


c.  =  m 


'max 


A* 
''max 


Formula  cited 
(3.380) 
(3.381) 
(3.381) 
(3.381) 
(3.381) 
(3.385) 
(3.387) 
(3.389) 
(3.389) 
(3.389) 
(3.389) 
(3.407) 
(3.422) 
(3.428) 
(3.441) 
(3.446) 
(3.446) 
(3.446) 
(3.446) 
(3.464) 
(3.465) 


Value 
4 
{i} 

{i,j,k} 
{i,j,k,e} 
216 
78 
3 
9 
27 
78 
9.6053 
1.0 

^8  +  -^max'^e 
2.5540 

9 

3 

I3®  Jg 


I9®  J3 


0.5 


^^7  +  ^max^ 
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Table  3.16  -  continued 


Quantity  Formula  cited  Value 

Fj  (3.469)  0.7592 

Fj  (3.469)  1.9929 


Subsection  3.4.3  we  derived  an  exact  test  for  testing  HQ:crg=:0.   This 
test  is  the  usual  ANOVA  test.   Next  we  considered  testing  HqCo-^..  =0. 
In  this  case,  we  employed  resampling  and  made  a  transformation  that 
resulted  in  a  model  which  resembled  an  unbalanced  (s-l)-fold  nested 
model.   Using  this  fact,  we  were  able  to  consturct  an  exact  test  by 
deriving  a  test  statistic  that  was  similar  to  the  one  used  in  testing 
the  equivalent  null  hypothesis  in  an  unbalanced  (s-l)-fold  nested 
model  in  which  all  the  effects  are  considered  fixed. 

Finally,  in  Subsection  3.4.5,  we  constructed  exact  tests  for 
testing  <7j ,  (^2' '  '  ' ''^s-2'      ^^^  motivation  behind  our  procedure  came 
from  noting  that  after  making  the  above  transformation,  the  resulting 
model  resembled  an  unbalanced  random  nested  model  with  imbalance 
affecting  the  last  stage  only.   Consequently,  we  could  apply  the 
results  in  Khuri  (1990)  to  aid  in  the  development  of  our  methodology. 

From  Seely  and  El-Bassiouni  (1983),  it  is  easily  seen  that  the 
exact  test  for  testing  Ho:o'|  =  0  is  a  Vald's  test  associated  with  model 
(3.383).   Similarly,  the  exact  test  for  testing  E^-.a^    -.=0   is  a  Wald's 
test  associated  with  model  (3.429).   Recalling  (3.464)  and  (3.466),  let 


:^^^  =  (r[:r!,:...:r\y,         i  =  1 ,2,  •  •  •  ,s-l . 
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Table  3.17    Results  from  our  testing  procedures 


Test  for     Numerator  Denominator 

factor    degrees  of  freedom  degrees  of  freedom 


A 

B(A) 
C(AB) 
D(ABC) 


2 
6 

18 
51 


6 

18 

51 

138 


F-value    p-value 


Fi=1.9929    0.2169 


F2=0.7592    0.6109 


F3=2.5540    0.0045 


F4=9.6053  < 0.0001 


Then,  the  exact  test  for  testing  Hq:(7'-=0,  i  =  l,2,---,s-2,  is  a 
Vald's  test  based  on  the  model  involving  only  Ij ,l2' '  '  ' 'I' +1  (that  is, 
based  on  r^    ^ )  . 

As  a  final  remark,  the  extension  to  more  general  nested  models, 
including  models  in  which  some  of  the  effects  are  fixed,  should  offer 
no  resistance.   Ve  merely  invoke  our  resampling  scheme  (when 
necessary)  and  make  transformations  in  a  "step-down"  manner  similar 
to  the  methodology  presented  in  Section  3.2  and  in  this  section  to 
derive  the  exact  tests. 


CHAPTER  FOUR 
CONCLUSIONS  AND  DIRECTIONS  FOR  FUTURE  RESEARCH 


In  the  teaching  of  linear  models  there  seems  to  be  an 
underlining  hierarchy  that  is  followed.   The  first  linear  model  most 
students  and/or  researchers  are  exposed  to  is  one  in  which  the 
effects  are  fixed  and  the  data  balanced.   For  such  models,  the  idea 
of  an  "analysis  of  variance"  is  used  to  construct  tests  of  hypotheses 
concerning  the  fixed-effects  in  the  model.   These  tests  are  easily 
derived  from  the  corresponding  ANOVA  table.   Furthermore,  they  enjoy 
many  optimal  properties.  The  next  step  up  this  hierarchy  brings  us  to 
unbalanced  fixed-effects  models.   Now  the  picture  is  much  more 
muddled.   For  example,  it  is  quickly  learned  that  there  is  no  unique 
partitioning  of  the  total  suras  of  squares  (as  there  is  in  the 
balanced  data  situation)  and  thus  no  unique  way  to  construct  the 
ANOVA  table.   Hence,  the  sums  of  squares  used  in  a  test  of  a 
particular  hypothesis  must  be  chosen  carefully  by  the  researcher. 
Otherwise,  very  spurious  results  may  be  obtained. 

The  concept  of  a  balanced  random  or  mixed  model  is  usually  not 
introduced  until  after  thoroughly  analyzing  the  fixed-effects  model. 
After  discussing  the  differences  between  these  types  of  models,  the 
focus  typically  shifts  to  the  problem  of  constructing  point 
estimators  for  the  variance  components  in  the  model.   Exact  tests  of 
hypotheses  concerning  the  variance  components  and/or  estimable  linear 
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combinations  of  the  fixed-effects  parameters  rest  upon  finding  mean 
squares  that  can  serve  as  "error"  terms  in  appropriate  F-ratios. 
However,  unlike  the  balanced  fixed-effects  case,  exact  tests  (based 
on  the  ANOVA  mean  squares)  do  not  always  exist.   This  phenomenom  is 
even  more  apparent  when  the  data  are  unbalanced.   Consequently,  for 
unbalanced  random  or  mixed  models,  tests  of  hypotheses  are  usually 
only  developed  in  the  context  of  an  exercise  in  the  application  of 
Satterthwaite ' s  procedure.   Therefore,  we  are  left  with  the  idea  that 
in  the  unbalanced  data  situation  exact  tests  either  do  not  exist  at 
all  or  only  exist  for  very  simple  models.   Although  Satterthwaite' s 
methodology  may  provide  adequate  (approximate)  tests  when  the  design 
is  balanced,  there  is  no  theoretical  justification  for  its  use  when 
the  data  are  unbalanced  since  in  these  situations  the  necessary 
assumptions  will  not,  in  general,  be  satisfied.   The  need  for  exact 
testing  methods  is  therefore  clear.   The  derivations  of  some  such 
methods  are  described  in  this  thesis. 

In  the  development  of  our  procedures,  we  made  the  "usual" 
assumptions  about  the  random-effects.   That  is  we  assumed  they  were 
independent  of  each  other  and  had  normal  distributions  with  zero 
means  and  "a  I"  variance-covariance  structure.   This  led  to  a 
particular  structure  for  the  Var(y) .   But,  are  these  assumptions 
correct?   Some  diagnostic  tools  would  be  of  help  here  and  perhaps  the 
results  of  Hocking  (1988),  Hocking,  Green,  and  Bremer  (1989),  and 
Beckman,  Nachtsheim,  and  Cook  (1987)  could  be  of  use.   Also,  it  would 
be  of  interest  to  develop  exact  tests  under  more  laxed  assumptions. 
Except  for  a  few  special  cases,  this  has  not  been  pursued. 
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The  problem  of  constructing  optimal  tests  in  unbalanced  random 
or  mixed  models  seems  to  be  a  difficult  one.  Besides  the  results  of 
Spj0tvoll  (1967),  Mostafa  (1967),  and  Mathew  and  Sinha  (1988),  little 
is  known  about  this  area.   Even  these  papers  deal  only  with  very 
simple  models.    Therefore  it  is  not  known  if  our  tests  (or  tests 
constructed  similarly  to  ours)  possess  any  optimal  properties  similar 
to  the  ones  mentioned  in  the  above  articles.   A  full  solution  to  this 
problem  may  be  unattainable.   Partial  solutions  may  possibly  be 
obtained  through  intensive  computer  simulation. 

Ofversten  (1989)  also  constructed  exact  tests  for  the  variance 
components  in  unbalanced  random  and  mixed  models.   His  methods 
differed  from  ours  in  the  way  he  initially  attacked  the  problem  -- 
orthogonally  transforming  the  model  matrices  into  layered-triangular 
form  (see  Allen  (1974)1.   However,  this  transformation  did  not  always 
lead  to  an  exact  test.   When  it  did  not,  Ofversten  relied  on  a 
resampling  scheme  similar  to  ours  to  aide  in  the  construction  of  the 
test.    Although  his  procedure  is  adaptable  to  many  unbalanced 
models,  it  is  not  clear  how  to  utilize  it  in  models  not  discussed  in 
his  dissertation.   Still  it  would  be  of  interest  to  compare  his 
results  to  ours. 

A  drawback  to  our  procedures,  and  indeed,  to  all  procedures 
utilizing  a  transformation  based  on  a  resampling  scheme,  is  that  the 
values  of  the  test  statistics  are  not  unique.   This  is  true  since 
only  a  portion  of  the  error  vector  is  utilized  in  these 
transformations.   However,  the  distributional  properties  of  the 
resulting  vector  remain  unchanged  no  matter  what  portion  of  the  error 
vector  is  used  in  the  transformation.  Consequently,  the  power  of  the 
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exact  tests  will  not  depend  on  what  portion  is  used  in  the 
transformation.   The  power  will,  however,  depend  on  certain  design 
parameters  (eigenvalues  of  certain  matrices).   Thus,  it  is  of 
interest  to  know  exactly  how  much  influence  these  design  parameters 
have  on  the  power.   This  has  yet  to  be  resolved. 

In  this  dissertation  we  have  developed  exact  procedures  to 
handle  the  variance  components  and  estimable  linear  functions  of  the 
fixed-effects  in  some  unbalanced  random  and  mixed  models.   As  noted 
above,  there  are  still  many  unresolved  problems  left  to  tackle.   It 
is  our  hope  that  the  techniques  put  forth  in  this  work  and  the  ones 
sure  to  be  developed  elsewhere  will  help  in  the  construction  of 
solutions  to  these  challenging  problems. 


APPENDIX  A 
MATHEMATICAL  RESULTS 


We  present  some  useful  mathematical  facts  in  this  appendix. 
When  necessary,  proofs  will  be  supplied.   Otherwise,  the  relevant 
source  will  be  cited.   We  begin  with  a  result  dealing  with  the  rank 
of  a  product  of  matrices 

Lemma  A.l  (Frobenius  Inequality) 

Let  A,  B,  and  C  be  matrices  of  dimensions  nxp,  pxq  and  qxt, 
respectively.   Then 

rank(ABC)  >  rank(AB)  +rank(BC)  -rank(B) 


Proof : 


First  note  that  if  either  A,  B,  C,  AB,  or  BC  is  a  zero  matrix 
then  the  result  trivially  follows.   So,  suppose  that  none  of  these 
matrices  have  rank  zero.   Then,  there  exists  nonsingular  matrices  P 
and  Q,  both  of  order  qxq,  such  that 


P(AB)'(AB)q 


^AB 

0 

0 

0 

°^B-' 

AB 

0 

0 

0 

0, 

1  — 

q-rg 


(A.l) 
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where  r ■ g = rank(AB)  > 0  and  0 < rg = rank(B)  < q. 


Define  the  qxq  matrix  X  as 


X  =  P" 


^AB 

0 

0 

0 

Ir       r 

AB 

0 

0 

0 

0, 

1  — 

q-rg 


Q-^ 


(A. 2) 


From  (A.l)  and  (A. 2)  we  have 


(AB)'(AB)+X  =  P-^ 


^B 
0   0 


0 
q-rg 


r'. 


(A. 3) 


Write  Q"^  as  Q"^  = 


where  Qj  is  of  order  rgXq  while  Q^   i^  o^ 


order  (q  — rg)xq.   Then  from  (A. 3), 


■^B 
0   0 


0 

q-rg 


=  p- 


0 


(A. 4) 


Also,  write  B  as 


B  = 


(A. 5) 


-271- 


where  Bj  is  of  order  rg  x  q  and  Bj  is  of  order  (p  — rD)xq  (rg<P  since  B 
is  pxq).   The  rg  rows  of  Bj  are  chosen  to  be  linearly  independent. 
Since  Q~^  is  nonsingular  it  follows  that  the  rg  rows  of  Q^  are  also 
linearly  independent.   Thus  there  exists  a  nonsingular  matrix  M  of 
dimension  rn x rg  such  that 


Qi  =  MBj 


(A. 6) 


Now 


But 


rank(BC)  =  rank(BiC)  =  rank(M"^QiC)  =  rankCQiC)  , 


rank(Q^C)  =  rank! 


0 


=  rank! 


Ir     0 
^B 


0   0 


q-rg 


Qi 


=  rank  p-^ 


^B 
0   0 


0 
q-rg 


q-^c 


=  rank((AB)'ABC  +  Xc) 


fsee  (A. 3) 


<  rank((AB)'ABc)+rank(XC) 
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<rank(ABC)  +rank(X) 


rank(ABC)+rank(B)-rank(AB)  (from  (A. 2)) 


Thus, 


rank(ABC)  >  rank(AB)  +rank(BC)  -rank(B) 

D 

Remark  A . 1 . 1 

Suppose  q  =  p  and  let  B  be  the  pxp  identity  matrix.   Then,  by 
this  lemma  we  have 

rank(AC)  >  rank(A) -l-rank(C) -p  (A. 7) 

Furthermore,  if  n  =  p  then  we  get,  as  a  special  case  to  this  lemma, 
Sylvester's  law  of  nullity  (see  Searle  (1982)  p.  198]. 

Remark  A . 1 . 2 

Most  often  we  will  use  Lemma  A.l  in  the  following  context:   A  is 
(p-l)xp  of  rank  p-1;  B=Ip;  C  is  pxt  of  rank  r<p-l.   Further,  we 
will  be  able  to  find  a  vector  x^O    such  that  Cx  ,1^  0  and  ACx  =  0. 
Consequently  it  is  clear  that 
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rank(AC)  <  r-1 

By  the  Frobenius  inequality  we  will  also  have 

rank(AC)  =  rank(A)  -|-  rank(C)  -  p 

=  p-1  -|-r-p  =  r-1 

Thus,  rank(AC)  = r-1 . 

The  next  three  results  deal  with  the  simultaneous 
diagonal ization  of  two  symmetric  matrices.   The  proofs  to  all  three 
of  these  lemmas  can  be  found  in  Searle  (1982,  pp.  312-314).   However, 
we  will  present  a  slightly  different  proof  to  Lemma  A. 4  then  what  is 
found  in  Searle. 

Lemma  A . 2 

For  symmetric  A  and  B  of  the  same  order  there  exists  an 
orthogonal  matrix  P  such  that  P'AP  and  P'BP  are  both  diagonal  if  and 
only  if  AB  =  BA. 

Remark  A . 2 . 1 

Since  A  and  B  are  both  symmetric,  the  condition  AB =  BA  is 
equivalent  to  showing  that  AB  is  symmetric. 
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Lemma  A . 3 

For  symmetric  matrices  A  and  B  of  the  same  order,  with  A  being 
positive  definite  (p.d.),  there  exists  a  nonsingular  matrix  M  (not 
necessarily  orthogonal),  such  that  M'AM  =  I  and  M'BM  is  a  diagonal 
matrix  whose  diagonal  elements  are  solutions  A  to  |B  — AA|  =0. 

Remark  A . 3 . 1 

The  diagonal  elements  of  M'BM  are  called  eigenvalues  of  B  with 
respect  to  A.   Since  A  is  p.d.  it  is  clear  that  these  diagonal 
elements  are  also  the  eigenvalues  of  A~^B  which  are  equivalent  to  the 

eigenvalues  of  BA~  which  are  equivalent  to  the  eigenvalues  of 

1    1 
A  2BA  2. 

Lemma  A. 4 

For  real  symmetric  matrices  A  and  B  of  the  same  order  and  non- 
negative  (n.n.d.)  there  exists  a  nonsingular  matrix  P  such  that  P'AP 
and  P'BP  are  both  diagonal. 


Proof : 


Let  the  order  of  A  and  B  be  n,  with  ranks  a  and  b,  respectively, 
and  without  loss  of  generality  suppose  a<b.   Since  A  is  symmetric 
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and  n.n.d.,  there  exists  a  nonsingular  matrix  T  such  that 


T'AT 


la   0 
0  0 


n-a 


(A. 8) 


Denote  T'BT  by 


T'BT  = 


E   F 
F'   G 


(A. 9) 


where  E  is  axa  and  G  is  (n-a)  x  (n-a).   Now,  let  C  be  any  (n-a)xa 
matrix  which  satisfies 


F  +  C'G  =  0 . 


(A. 10) 


One  such  choice  for  C  is 


C  =  -G"F' 


(A. 11) 


where  G   is  a  generalized  inverse  of  G.   That  is,  G   satisfies 
GG~G  =  G.   Consider  the  nonsingular  matrix  S  defined  as 


la   0 
C   M 


(A. 12) 


where  M  is  an  (n-a)  x  (n-a)  nonsingular  matrix  satisfying 


M'GM  = 


Ig    0 


0   0 


n-a-g 


(A. 13) 
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where  g  =  rank(G).   Now  S'T'ATS  =  T'AT  by  the  construction  of  S.   While, 
by  (A. 10)  and  (A. 12),  S'T'BTS  becomes 


S'T'BTS 


where  H  is  the  axa  matrix 


H 

0 

0 

0 

h 

0 

0 

0 

On-  = 

n-a-g 


(A. 14) 


H  =  E  + C'F'. 


(A. 15) 


Note  that  the  rank  of  H  is  b-g.   Since  C'F'  can  also  be  expressed 

as  —  C'GC  (see  (A.  10)1,  we  have  that  H  is  symmetric.   Thus  there  exists 

an  orthogonal  matrix  Q  such  that 


Q'HQ  =  Da  =  diag(Ai,  •  •  •  ,Aj^_g,  0,  •  •  •  ,0) 


(A. 16) 


where  A^,A2,  •  •  •  5'^i,_g.  are  the  nonzero  eigenvalues  of  H.   Therefore, 
upon  defining  R  as 


R  = 


q 

0 

0 

0 

h 

0 

0 

0 

-'-n  -s 

n-a-g 


(A. 17) 


we  have  from  (A. 14)  and  (A. 16) 
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R'S'T'BTSR : 


0   0 


0 
0 

n-a-g 


(A. 18) 


Also, 


R'S'T'ASTR : 


Q'q 

0 


'n-a 


(A. 19) 


la     0 
0    0„_ 


(A. 20) 


Hence,  upon  letting 


P  =  TSR 


(A. 21) 


we  have  P'AP  and  P'BP  both  diagonal, 


D 


Corollary  A. 4.1 


Suppose  the  conditions  of  lemma  A. 4  hold.   Further,  if 
a  =  rank(A)  and  b  =  rank(B),  suppose  that  rank(A+B)  =  a+b  <  n.   Then 
there  exists  a  nonsingular  matrix  L  such  that 


0 


L'AL  = 


0     0. 


0   0 


0 
0 

n-a-b 


(A. 22) 


and 
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L'BL 


Oa 

0 

0 

0 

lb 

0 

0 

0 

o„  „ 

n-a-b 


(A. 23) 


Proof: 


From  lemma  A. 4,  there  exists  a  nonsingular  matrix  P  such  that 
P'AP  and  P'BP  are  both  diagonal.   Since  the  rank(A+B)  =  a+b  =  rank(A) 
+rank(B),  it  follows  that  the  nonzero  diagonal  elements  of  P'AP 
correspond  to  zero  diagonal  elements  of  P'BP  and  vice-versa.   Clearly 
then,  there  exists  nonsingular  matrices  Mj  and  Mj  for  which  M'P'APM  and 
M'P'BMP  have  the  forms  indicated  in  (A. 22)  and  (A. 23),  respectively, 
where  M^M^Mj.   Upon  defining  L  =  PM,  the  result  follows.  D 

Remark  A . 4 . 1 

Although  not  explicitly  stated  in  the  proof  to  lemma  A. 4  or 
corollary  A. 4.1,  it  should  be  noted  that  H,  defined  in  (A. 15),  is 
n.n.d.   This  follows  from  the  fact  that  C'F'  can  be  written  as 


C'F'  =  -FG~F' 


(A. 24) 


(see  (A.  11)  and  use  the  facts  that  FG  F'  is  invariant  to  choice  of  G 
(corollary  12.2.20.2  of  Graybill  (1983,  p.  416))  and  that  E-FG~F'  is 
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n.n.d.   (Theorem  12.2.21(3)  of  Graybill  (1983,  p.  417))).   Thus,  the 
eigenvalues  of  H  are  nonnegative. 

Ve  present  one  last  result  concerning  the  diagonal ization  of 
symmetric  matrices.   This  will  extend  Lemma  A. 2.   The  reader  is 
referred  to  Graybill  (1983,  p.  408). 

Lemma  A . 5 

Let  Aj,A2,---,A,  be  symmetric  nxn  matrices.   A  necessary  and 
sufficient  condition  that  there  exists  an  orthogonal  matrix  P  such 
that  P'A- P  is  diagonal  for  all  i  =  l,2,---,k  is  A-A-=A-A-  (or  A-A-  is 
symmetric)  for  all  i  and  j. 


APPENDIX  B 
STATISTICAL  RESULTS 


In  this  appendix  we  give  some  statistical  results.   Most  of  the 
following  concepts  are  well  known  and  are  only  presented  for  the  sake 
of  completeness.   Unless  otherwise  stated,  these  can  be  found  in 
Graybill  (1976,  pp.  134-140). 

Lemma  B . 1 

Let  the  nxl  random  vector  y  be  distributed  as  N(/i,E)  where  E  has 
rank  n.   If  BEA  =  0,  the  quadratic  form  y'Ay  is  independent  of  the 
linear  form  By  where  B  is  a  qxn  matrix. 

Lemma  B.2 

Let  the  nxl  random  vector  y  be  distributed  as  N(/i,E)  where  E  has 
rank  n.   If  AEB  =  Oj  then  the  two  quadratic  forms  y'Ay  and  y'By  are 
independent. 

Lemma  B.3 

Let  y  be  an  nxl  random  vector  and  let  E{y)  =  fi,    Var(y)=E.   Then 
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E(y'Ay)  =  tr(AE)+yA/i 


The  next  two  results  deal  with  the  distribution  of  a  quadratic  form. 


Lemma  B . 4 


Let  the  nxl  random  vector  y  be  distributed  as  N(^,E)  where  £  has 
rank  n.   Then  the  quadratic  form  U  =  y'Ay  ~  Xp(-^)  where  p  is  the  rank(A) 
and  A  =  i/i'A/j,  if  and  only  if  one  of  the  following  three  conditions  are 
satisfied:   (1)   AS  is  an  idempotent  matrix  of  rank  p;  (2)  EA  is  an 
idempotent  matrix  of  rank  p;  (3)  E  is  a  generalized  inverse  of  A 
(i.e.,  AEA  =  A). 


Lemma  B . 5 

Let  the  nxl  random  vector  y  be  distributed  as  N(/i,E)  where  E  has 

rank  n.   The  random  variable  y'Ay  has  the  same  distribution  as  the 

n 
random  variable  U,  where  U=  S^^^U^  where  the  d^^  are  the 

i  =  l 
eigenvalues  of  the  matrix  AE,  and  where  U^ jUj,  •  •  •  ,Ujj  are  n  independent 

noncentral  chi-square  random  variables,  each  with  one  degree  of 

freedom. 

Our  last  lemma  applies  some  of  the  above  results  to  a  general 
mixed-model.   This  result  is  particularly  useful  since  it  is  used 
throughout  this  dissertation. 


-282- 


Lemma  B.6 

Consider  the  general  mixed-model 

y  =  Xg  +  Zh  +  £  (B.l) 

where  X  =  (X^:  •  •  •  :X^_p)  ;  Z  =  (X^_p^i :  •  •  •  :X^)  ;  g=  {P'o- ■  ■  ■  •Pu-pY   is  the 
vector  of  fixed  effects;  and  h  =  (^3'^^  _^-|^ :  •  •  •  :^'^,)'  is  the  vector  of 
random  effects.   We  will  assume  that  the  nxl  vector  of  errors,  e,  is 
distributed  as  N(0,(T^Ijj)  independently  of  h~N(0,*).   Consequently, 

y~N(Xg,S)  (B.2) 


where 


E  =  Z-^Z' +  <t|Iii  .  (B.3) 


For  ease  of  notation  let  W=[X:Z]  and  let  r  =  rank(V) .   Also,  define 
the  nxn  matrix  R  to  be 

R=Iii-V(W'V)~T^'  (B.4) 

Then 

(i)   R  is  symmetric,  idempotent  of  rank  n-r. 
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(ii)   Clearly  R¥  =  0.   Since  range(Xj^)  C  range (V)  for  all 

1  =0,1,  •  •  •  ,1/,  it  follows  that  RX^=0,   i  =  0, 1 ,  •  •  •  ,i/. 

(iii)  The  error  sum  of  squares  can  be  written  as 

SSE  =  y'Ry.  (B.5) 

Since  from  (i)  and  (ii),  RE/cr^  =  R  (which  is  idempotent) 
and  RXg  =  0,  we  see,  using  lemma  B.4,  that 

SSE/a|~xLr- 
(iv)   From  (i),  there  exists  an  orthogonal  matrix  C  such  that 

R  =  CAC'  ,  (B.6) 


where 


Ar=diag(I^^,I^^,0)  (B.7) 


and 


rji-fr?2  =  n-r.  (B.8) 


Suppose  7;j>0,  then  for  r]2   to  be  positive  we  must 
necessarily  assume 
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n>r?i-t-r.  (B.9) 


Partition  C  as 


C=[Ci:C2:C3]  (B.IO) 


where   Cj    is    n  x  rjj ;    C2    is    n  x  772  i    ^^nd   C3    is    nxr.      Then 


CjC.  =1  (i  =  1,2,3) 


C'.Cj  =  0  i#j, 


(B.ll) 


Furthermore 

SSE  =  y'Ry  =  y'CAC'y 

=  y'CiCiy  +  y'C2C^y 

=  SSEi  +  SSE2  (B.12) 

and  SSEj/o-^  ~  Xn     independently  of  SSEj/cr^  ~  x     . 

This  forms  the  basis  behind  the  idea  used  in 
resampling  from  the  error  vector. 

(v)    Suppose  range(X- )  C  range(X^)  for  all  i  =  0, 1 ,  •  •  •  ,  J'-l . 
Define  the  matrix  D  as 
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D=(XX)-X-  (B-13) 


Further,  let  the  random  vector  x  be 


x  =  Dy.  (B.14) 


Then,  by  lemma  B.l,  since 


DER  =  (T^DR  =  0  (B.15) 


we  have  that  x  =  Dy  and  SSE  =  y'Ry  are  independent,  y 
fact  that  DR  =  0  follows  after  noting  that  range(Xj^) 
Crange(V)  and  RW==0.) 


Th 


e 


APPENDIX  C 
AN  ILLUSTRATION  OF  THE  UTILITY  OF  RESAMPLING 


In  this  appendix  we  consider  the  general  unbalanced  mixed  cross- 
classification  model  to  illustrate  the  idea  of  resampling  from  the 
error  vector.   Ve  assume  the  imbalance  affects  the  last  stage  only. 
Thus,  this  model  is  a  special  case  of  model  (3.189)  in  Section  three 
of  Chapter  three. 

As  it  applies  to  models  having  imbalance  in  the  last  stage  only, 
resampling  serves  to  reduce  the  analysis  of  the  unbalanced  model  to 
that  of  a  balanced  one.   For  designs  having  more  general  forms  of 
imbalance,  resampling  will  typically  need  to  be  employed  more  than 
once  to  construct  exact  tests.   This  is  the  case  for  the  designs 
studied  in  Sections  two  and  four  of  Chapter  three. 

Ve  start  by  considering  the  following  model: 

y  =  Xh  +  Zg  +  e  (C.l) 

where  X=(Xo:..-:X^_p);  Z  =  (X^_p^-^ :  ■  ■  •  :X^)  ;  h  =  (^[):  •  ■  •  :/?;,_p)'  is  the 
vector  of  unknown  parameters;  g  =(/?(,_„,  i  :•••  :/?^)'  is  the  vector  of 
random  effects,  y  is  the  nxl  vector  of  observations;  and  e  is  the 
nxl  vector  of  errors.   We  assume  that  /?j/_r)+l''  '  '  yPi/  ^^^  i   ^^^ 
independent,  normally  distributed  random  vectors  with  zero  mean 
vectors  and  variance-covariance  matrices  given  by 


-286- 


-287- 


Var(^.)  =(t|Ic.     (i  ^  u-p+1 ,  ■  ■  ■  ,iy) 


VarU)=<^?In 


where  c-  is  the  number  of  elements  in  /J^  (i  =  0, 1 ,  •  •  •  jZ^) 
The  vector  of  cell  means,  y,  can  be  expressed  as 


(C.2) 


y-Dy 


(C.3) 


where 


D=(XX)"'X^ 


(C.4) 


we  can  also  express  (C.3)  as 


y  =  Hh  +  Gg+I 


(C.5) 


where  H=  (Ag:  •  •  •  lA^^.p)  and  G  =  (Aj^.p^^:  •  ■  •  :Aj,=  Ic^)-   Also,  e  =  De  . 
The  A-  (i  =  0,1,  •  •  •  ,J/-1)  are  described  in  Khuri  (1990).   In  this  same 
article,  Khuri  mentions  that  the  matrices  B-  =  A- A'-  (i  =  0,1 ,  •  •  •  ,i^) 
commute  in  pairs.   Consequently,  by  lemma  A. 5,  there  exists  an 
orthogonal  matrix  Q  which  simultaneously  diagonalizes  them.   Write  Q 
as 


q  = 


(C.6) 
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where  Qj  is  constructed  so  that 

Q2H  =  0.  (C.7) 

From  (C.5)  and  the  distributional  properties  of  g  and  e,    it  can  be 
easily  shown  that 

E(y)=Hh 


Var(y)=   £  a\A  .A\  +  al(X'^\^r^ 
i=j/-p+l 

i=i/-p+l 


where 


Now,    define   the   qxl    random  vector   u   as 


where   q    is    the   number   of   rows   of   Q,.      Then 


E(u)  =Q2Hh  =  0 


Var(u)=       E       <T\Q^B^Q'^  +  all^-\-\i^my,. 
i=i'-p-i-l 


(C.8) 


K  =  (x;,X^)-i.  (C.9) 


u-QiV  (C-10) 


(C.ll) 
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If  not  for  the  Q2KQ20-e  term  in  the  expression  for  Var(u)  in  (C.ll),  we 
could  analyze  model  (C.IO)  as  a  balanced  model.   This  is  where  the 
utility  of  resampling  appears.   Coupled  with  the  results  of  Lemma 
B.6,  resampling  will  allow  us  to  reduce  the  analysis  of  model  (CIO) 
to  that  of  a  balanced  model. 
From  (C.l),  write 

W=[X:Z]  (C.12) 


and 


R=Ijj-W(W'V)-V'  (C.13) 


Assuminf 


n>q  +  c^  (C.14) 

and  using  lemma  B.6(iv),  we  can  write  R  as 

R  =  CAC'  (C.15) 

where  C  is  orthogonal  and  A  is  diagonal.   Furthermore,  we  can 
partition  C  and  A  as 
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C  —  [Ci:C2:C3] 

(C.16) 

A  =  diag(lq,lii_c^_q,0) 

where   Cj    is    nxq;    Cj    is    nx(n-c^-q);    and   C3    is   nxc^.       Define    the   qxl 
random   vector   w  as 


y  =  u  +  (AmaxIq-L)^y  (^-17) 


where  L  =  QjKQ^  and  X^g^^   is  its  largest  eigenvalue.   C^  is  defined  in 

(C.16).   Notice  that  A^^^^Iq-L  is  positive  semidef inite  and  hence 

1 
('^maxIq~L)2  is  well-defined.   From  lemma  B.6  the  following  facts  are 

apparent: 

(i)        u    is    independent  of   C^y. 

(C.18) 

(ii)      CiECi  =  cr|lq. 

Consequently,  we  obtain  from  (C.ll): 

E(w)  =0 

(C.19) 

Var(y)=:   E   (tIQ^B- Q^  +  5Iq 
i=i/-p+l 
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where  6  -  crl  + ajX^g^^.      Thus,  resampling  has  reduced  the  analysis  of  the 
unbalanced  model  (CIO)  to  that  of  a  balanced  one  (C.17)  .   ^See,  for 
example,  Searle  (1971),  Graybill  (1976),  or  Khuri  (1982)  for  a 
discussion  on  the  analysis  of  balanced  models.) 

In  (C.17)  note  that  C^y  is  a  portion  of  the  error  vector,  Ry. 
The  actual  value  of  C^y  depends  on  which  q  columns  are  chosen  from  the 
first  n-Cj,  columns  of  C  to  form  C^  fsee  (C.16)j.   However,  the 
distribution  of  u)   will  not  depend  on  the  choice  of  C^ .   Thus,  this 
approach  is  similar  in  spirit  to  methodologies  employing  resampling, 
such  as  the  jackknife  and  bootstrap. 


APPENDIX  D 
HIROTSU'S  APPROXIMATION 


In  this  Appendix  we  describe  the  F  approximation  (used  in  some 
of  our  power  calculations)  as  developed  in  Hirotsu  (1979).   We  use 
his  notation. 

Let 

x  =  N(/i,E) 


and  let  Xf  be  a  x^   variable  distributed  independently  of  x.   Further, 
take  /i  =  0  (as  in  the  case  in  our  approximations).   Then  we  are 
concerned  with  finding  P(Z  >  z)  where 


Z  = 


(x'Ax)/cf  .p  ^s 

2  /j:   '  y    •    J 


and  A  is  a  nonnegative  definite  matrix.   The  values  of  c  and  f  are 
given  by 

(D.2) 
f  =  2/cf(x'Ax)//C2(x'Ax) 

where  /c-(x'Ax)  represents  the  i*"  cumulant  of  x'Ax.   In  this  situation, 
these  quantities  become: 
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c  =  tr[(AE)2]/tr(AE) 


f  =[tr(AE)]Vtr[(AE)2], 


Hirotsu's  approximation  is  then  given  by 


P(Z>z)  =P(F^  f  >z) 


+  [V{3(f+2)(f+4)B(|f,if2)}] 


(D.3) 


(l  +  fz/f2)"^^^^^2\fz/f2)^^ 


,   2(f+f2)(f+4)    (f+f2+2)(f+f2)' 


(D.4) 


where  B( •  ,  •  )  is  the  beta  function  and 


|>Ci(x^Ax)k3(x'Ax)   ^ 
«:2(x'Ax) 


{tr(AS)}{tr[(A£)^]}      ^ 
{tr[(AE)2]}2 


(D.5) 


The  approximation  is  based  on  a  generalized  Laguerre  polynomial 
expansion  of  the  true  distribution  of  (x'Ax)/2c  and  has  been  shown  to 
provide  satisfactory  results. 
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